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Introduction  to  ARL 


The  Army  Research  Laboratory  of  the  U.S.  Army  Research,  Development  and  Engineering  Command  (RDECOM)  is  the  Army's 
corporate  laboratory.  ARL's  research  continuum  focuses  on  basic  and  applied  research  (6.1  and  6.2)  and  survivability/lethality 
and  human  factors  analysis  (6.6).  ARL  also  applies  the  extensive  research  and  analysis  tools  developed  in  its  direct  mission 
program  to  support  ongoing  development  and  acquisition  programs  in  the  Army  Research,  Development  and  Engineering 
Centers  (RDECs),  Program  Executive  Offices  (PEOs)/Program  Manager  (PM)  Offices,  and  Industry.  ARL  has  consistently  provided 
the  enabling  technologies  in  many  of  the  Army's  most  important  weapons  systems. 

The  Soldiers  of  today  and  tomorrow  depend  on  us  to  deliver  the  scientific  discoveries,  technological  advances,  and  the  analyses 
that  provide  Warfighters  with  the  capabilities  to  execute  full-spectrum  operations.  ARL  has  Collaborative  Technology  Alliances  in 
Micro  Autonomous  Systems  and  Technology,  Robotics,  Cognition  and  Neuroergonomics,  and  Network  Science,  an  International 
Technology  Alliance,  and  new  Collaborative  Research  Alliances  in  Multiscale  Multidisciplinary  Modeling  of  Electronic  Materials 
and  Materials  in  Extreme  Dynamic  Environments.  ARL's  diverse  assortment  of  unique  facilities  and  dedicated  workforce  of 
government  and  private  sector  partners  make  up  the  largest  source  of  world  class  integrated  research  and  analysis  in  the  Army. 

ARL  Mission 

The  mission  of  ARL  is  to  "Provide  the  underpinning  science,  technology,  and  analysis  that  enable  full-spectrum  operations." 
Our  Vision 

America's  Laboratory  for  the  Army:  Many  Minds,  Many  Capabilities,  Single  Focus  on  the  Soldier 
ARLs  Organization 

•Army  Research  Office  (ARO)  -  Initiates  the  scientific  and  far  reaching  technological  discoveries  in  extramural  organizations: 
educational  institutions,  nonprofit  organizations,  and  private  industry. 

•Computational  and  Information  Sciences  Directorate  (CISD)  -  Scientific  research  and  technology  focused  on  information 
processing,  network  and  communication  sciences,  information  assurance,  battlespace  environments,  and  advanced  computing 
that  create,  exploit,  and  harvest  innovative  technologies  to  enable  knowledge  superiority  for  the  Warfighter. 

•  Human  Research  and  Engineering  Directorate  (HRED)  -  Scientific  research  and  technology  directed  toward  optimizing  Soldier 
performance  and  Soldier-machine  interactions  to  maximize  battlefield  effectiveness  and  to  ensure  that  Soldier  performance 
requirements  are  adequately  considered  in  technology  development  and  system  design. 

•Sensors  and  Electron  Devices  Directorate  (SEDD)  -  Scientific  research  and  technology  in  electro-optic  smart  sensors, 
multifunction  radio  frequency  (RF),  autonomous  sensing,  power  and  energy,  and  signature  management  for  reconnaissance, 
intelligence,  surveillance,  target  acquisition  (RISTA),  fire  control,  guidance,  fuzing,  survivability,  mobility,  and  lethality. 

•Survivability/Lethality  Analysis  Directorate  (SLAD)  -  Integrated  survivability  and  lethality  analysis  of  Army  systems  and 
technologies  across  the  full  spectrum  of  battlefield  threats  and  environments  as  well  as  analysis  tools,  techniques,  and 
methodologies. 

•  Vehicle  Technology  Directorate  (VTD)  -  Scientific  research  and  technology  addressing  propulsion,  transmission,  aeromechanics, 
structural  engineering,  and  robotics  technologies  for  both  air  and  ground  vehicles. 

•Weapons  and  Materials  Research  Directorate  (WMRD)  -  Scientific  research  and  technology  in  the  areas  of  weapons,  protection, 
and  materials  to  enhance  the  lethality  and  survivability  of  the  Nation's  ground  forces. 

ARL  Workforce  in  2013 

•  1,971  Civilians  -  41  Military 

•  1,181  Research  Performing  Workforce 

•  578  (49%)  hold  PhDs 

•  13  STs/ 26  ARL  Fellows 

ARL's  Primary  Sites 

•  Aberdeen  Proving  Ground,  MD 

•  Adelphi  Laboratory  Center,  MD 

•  White  Sands  Missile  Range,  NM 

•  Raleigh-Durham,  NC 

•  Orlando,  FL 

Visit  ARLs  web  site  at  www.arl.army.mil 


provide  our  scientists  and  engineers  access  to 
world-class  research  centers. 


ARL 

U.S.  ARMY  RESEARCH  LABORATORY' 
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FOREWORD 

Welcome  to  the  second  edition  of  the  Research  @ARL  monograph  series.  In  this 
edition,  we  share  ARL's  leading  edge  research  in  the  Network  Sciences.  Today's 
society  is  more  networked  than  ever  before,  and  today's  Army  reflects  that  reality. 
However,  the  tools  needed  to  understand  the  science  of  our  interconnected  global 
society  are  not  yet  fully  developed.  Many  of  the  networks  that  touch  our  everyday 
lives,  both  in  the  Army  and  outside,  are  complex  networks.  They  are  often  governed 
by  local  interactions  and  robust  self-organization  principles,  leading  to  complex 
dynamics.  And  networks  do  not  live  in  isolation;  they  are  interconnected  in  surprising 
ways.  A  modern  Army  cannot  function— any  more  than  a  modern  city  can— without  the 
latticework  of  transportation  systems,  utilities,  water  and  food  distribution  systems, 
communication  webs,  health  care  delivery  organizations,  and  a  myriad  of  other 
interconnected,  complex  networks.  These  layers  of  interpenetrating  networks  paint 
a  picture  of  today's  Army  that  overarches  the  traditional  physical  sciences,  social 
sciences,  life  sciences,  and  information  sciences.  No  one  discipline  or  collection  of 
disciplines  can  fully  embrace  the  spectrum  of  complex  problems  that  the  engineered 
networking  of  humanity  presents. 

ARL's  research  program  in  the  Network  Sciences  capitalizes  on  ARL's  expertise  in  basic  and  applied  research,  ranging  across 
a  multitude  of  science  areas,  while  simultaneously  drawing  upon  the  talents  of  others,  nationally  and  internationally,  through 
collaborative  alliances  with  industry,  laboratories,  and  academia.  We  seek  to  discover  the  underlying  mathematical  principles 
and  universal  laws  that  govern  the  behavior  of  co-evolving  multi-genre  networks.  We  also  seek  to  understand  emergent  behavior 
in  such  networks,  leading  to  mechanisms  for  thwarting  or  accelerating  the  emergence  of  interesting  behaviors.  Our  research 
teams  are  examining  the  complex  neurological  networks  within  the  human  brain  to  determine  how  individuals  make  decisions 
and  perform  other  cognitive  tasks  under  varying  levels  of  stress  and  uncertainty.  The  insight  gained  from  this  research  is 
being  applied  to  other  biological  networks  to  alleviate  the  injuries  and  pathologies  encountered  by  the  Warfighter.  Exploration 
of  cooperative  behavior  of  social  networks,  such  as  swarms,  is  a  significant  research  opportunity  to  control  the  mobility  of 
collections  of  autonomous  platforms.  In  addition  to  networks  in  the  life  and  social  sciences,  ARL  teams  are  examining  the 
connectedness  within  and  between  communications  and  information  networks  to  determine  how  their  complex  structures  can 
best  facilitate  the  gathering  of  intelligence  and  the  formation  of  collective  decisions. 

As  we  investigate  the  fundamental  science  addressing  these  critical  applications,  we  search  for  laws  and  principles  of  Network 
Science  that  parallel  and  compliment  those  in  physics.  Tentative  principles  are  presently  guiding  the  development  of  the 
mathematical  infrastructure  necessary  for  understanding  those  aspects  of  complex  networks  common  to  multiple  phenomena 
appearing  in  a  variety  of  scientific  disciplines. 

ARL's  long  term  commitment  to  encompass  interdisciplinary  sciences  in  such  diverse  areas  as  vulnerability  assessment, 
cognitive  sciences,  human  sciences,  electronic  devices,  weapons  and  materials  sciences,  information  sciences,  survivability, 
and  lethality,  uniquely  positions  ourteams  to  investigate  and  discover  previously  unrealized  commonalities  to  provide  innovative 
solutions.  Our  discoveries  enhance  our  country's  technological  capabilities,  our  national  security,  and,  ultimately,  enable  the 
invincible  Soldier  of  the  future. 


Dr.  John  Pellegrino 

Director  (A),  U.S.  Army  Research  Laboratory 
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Introduction  to  ARL  Research  in  Network  Sciences 
BJ.West,  A.  Swami,  J.N.  Mait,  and  P.J.  Franaszczuk 

1  Army  Research  Laboratory:  Network  Science  Program 

The  objective  of  the  Network  Science  Program  of  the  U.S.  Army  Research  Laboratory  (ARL)  is  to  perform  foundational  research 
on  network  science  (NS)  leading  to  a  fundamental  understanding  of  the  interplay  within  and  among  the  physical,  social/ 
cognitive,  information,  and  communication  networks.  This  research  is  expected  to  lead  to  insights  on  how  processes  and 
parameters  in  one  network  affect  and  are  affected  by  those  in  other  networks.  The  underlying  long-term  goal  is  to  optimize 
human  performance  and  to  greatly  enhance  speed  and  precision  for  complex  networked  military  operations. 

We  define  Network  Science  as  the  study  of  complex  systems  whose  behavior  and  responses  are  determined  by  exchanges 
and  interactions  between  subsystems  across  a  possibly  dynamic  and  usually  poorly  defined  set  of  pathways.  The  fundamental 
components  of  a  network  are  its  structure  (nodes  and  links  or  pathways)  and  its  dynamics,  which  together  specify  the  network's 
properties  (functions  and  behaviors). 

The  Army  is  embarked  upon  an  information  age  transformation  loosely  referred  to  as  Network  Centric  Warfare  (NCW)  or 
Network  Centric  Operations  (NCO).  NCW/NCO  seeks  to  dramatically  increase  mission  effectiveness  via  robust  networking  for 
information  sharing  leading  to  shared  situational  awareness,  improved  collaboration  and  self-synchronization,  and  enhanced 
sustainability  and  speed  of  command  [1,  2].  Central  to  this  effort  is  an  understanding  of  the  interaction  of  networks  in  the 
physical,  informational,  cognitive,  and  social  domains;  thus,  ARLs  NS  program  directly  supports  NCW/NCO. 

Society,  as  pointed  out  in  the  2005  National  Research  Council  (NRC)  report  Network  Science  [3],  is  more  interconnected 
than  it  has  been  at  any  time  in  world  history.  A  modern  city  could  not  function  without  garbage  collection,  sewers,  electricity, 
transportation,  water,  health  care,  and  food  and  fuel  distribution,  and  would  have  a  much  different  form  without  networks  of 
education,  banking,  telephone  service,  and  the  Internet.  Some  of  these  activities  form  local  physical  or  social  networks  within 
the  city  and  their  forms  have  been  evolving  for  millennia.  Part  of  that  evolution  was  the  development  of  their  interoperability  such 
that  these  networks  are  all  interconnected  and  in  one  way  or  another  they  connect  to  ever-expanding  global  networks.  These 
interconnections  can  lead  to  robustly  coevolving  networks,  but  under  some  conditions,  can  also  lead  to  cascading  failures.  This 
corresponds  to  the  first  finding  of  the  NRC  report:  "Networks  enable  the  necessities  and  conveniences  of  modern  life."  These 
are  the  engineered  networks  of  humanity,  but  there  are  comparable  structures  in  the  biosphere  and  ecosphere  involving  plant 
and  animal  networks  of  tremendous  variety. 

Moreover,  our  internal  world  is  also  an  interlacing  and  interacting  collection  of  networks.  The  neuronal  network  carrying  the 
brain's  signals  to  the  body's  physiological  networks  is  even  more  complex  than  the  modern  city.  The  bio/eco-networks  are 
certainly  as  difficult  to  understand  as  the  physical/social-networks.  This  led  to  another  finding  in  the  NRC  report:  "Social  and 
biological  networks  bear  important  similarities  to  engineered  networks."  Herein  we  present  a  number  of  studies  carried  out  by 
ARL  scientists  in  collaboration  with  the  broader  scientific  community  to  further  our  understanding  of  NS.  These  studies  suggest 
how  these  similarities  might  be  exploited  for  the  development  of  NS  that  is  of  value  to  society  in  general  and  to  the  United  States 
Army  in  particular. 

The  insights  and  mathematical  characterizations  resulting  from  ARLs  NS  research  will  enable  us  to  achieve  this  underlying 
goal,  by  providing  the  ability  to  predict  and  control  the  individual  and  composite  behavior  of  the  complex  interacting  physical, 
communications,  information,  and  social/cognitive  networks.  Network  science  is  one  of  ARLs  strategic  technology  investment 
areas,  with  a  strong  program  based  on  the  modern  research  pillars  of  theory,  simulation,  emulation,  and  experimentation.  It 
encompasses  a  variety  of  transdisciplinary  basic  and  applied  programs  across  ARLs  core  competency  areas  that  include  close 
collaborations  with  academic,  military,  and  industrial  research  partners  throughout  the  US  and  internationally.  These  include 
ARLs  Network  Science  Collaborative  Technology  Alliance  (OTA),  the  US-UK  Network  and  Information  Sciences  International 
Technology  Alliance  (ITA),  the  Micro  Autonomous  Systems  &  Technology  OTA,  the  Cognition  &  Neuroergonomics  OTA,  and  several 
Multidisciplinary  University  Research  Initiatives  (MURIs).  The  Institute  for  Collaborative  Biotechnology  is  a  University  Affiliated 
Research  Center  (UARC)  that  approaches  networks  from  the  perspective  of  biological  systems,  and  ARLs  Mobile  Network 
Modeling  Institute  (MNMI)  is  an  applied  research  effort  that  seeks  to  further  the  science  by  enabling  full  fidelity  at-scale 
modeling  and  simulation  of  large  scale  heterogeneous  networks,  incorporating  live  assets  in  the  emulation  environment.  More 
information  on  these  programs  may  be  found  by  navigating  ARLs  Collaborative  Technology  and  Research  Alliances  website  at 
http://www.arl. army.mil/www/default.cfm?page=93. 
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The  taxonomy  of  the  ARL  research  efforts  in  NS  can  take  a  number  of  forms,  two  of  which  are  given  here.  To  set  the  stage  for 
the  scientific  problems,  we  summarize  four  grand  challenges  and  identify  the  Army  programs  conducting  research  to  address 
these  challenges.  A  separate  and  distinct  way  of  organizing  the  research  is  via  the  network  category  addressed.  The  individual 
research  papers  are  identified  in  this  latter  grouping  according  to  discipline. 

2  Grand  Challenges 

The  ARL  research  effort  in  NS  seeks  to  make  fundamental  advances  of  and  contributions  to  the  science  so  as  to  enable  the  a 
priori  prediction  of  the  behaviors  of  complex  interacting  networks  in  diverse  and  dynamic  environments  and  an  understanding 
of  the  design  trade-offs  and  the  impacts  of  various  technologies  under  a  wide  variety  of  dynamic  and  adverse  conditions. 
Moreover,  it  is  intended  to  quantify  the  impact  of  network  technologies  both  technically  and  operationally  to  make  informed 
acquisition  decisions. 

2.1  Underpinning  Mattiematicai  Framework:  Dynamic  Muitipie-Scaie,  Cross-Genre  Coupiing 

Multiple-scale  and  cross-scale  couplings  within  and  across  networks  are  recurrent  themes  in  the  study  of  complex  physical, 
communication,  informational,  social,  and  biological  networks.  Existing  formalisms  address  one,  two,  or  a  continuum  of  scales, 
but  the  multi-node  problem  remains  unsolved.  Existing  mathematical  approaches  offer  insights  into  the  quantitative  aspects 
of  complex  phenomena  with  multiple  scales  (no  characteristic  scale)  in  stationary  and  near  equilibrium  networks,  but  what 
is  needed  now  is  a  way  to  describe  the  dynamics  of  such  complex  phenomena/networks  when  the  underlying  processes 
are  neither  stationary  nor  near  equilibrium  [4].  Network  characteristics  cannot  be  deduced  from  the  properties  of  individual 
components;  they  emerge  during  the  formation  and  growth  of  the  network.  Consequently  a  mathematical  framework  is  needed 
to  characterize  the  interactions  between  the  dynamic  network  components,  the  temporal  evolution  of  the  network,  and  its 
response  to  external  stimuli,  including  attacks.  The  framework  must  take  into  account  heterogeneity,  non-stationarity,  even 
non-ergodicity,  as  well  as  conflicting  constraints  and  objectives. 

One  way  to  characterize  the  interaction  of  networks  is  by  understanding  how  information  is  propagated  within  and  across 
complex  adaptive  networks  —  how  information,  in  a  general  sense,  flows  across  networks  so  as  to  maximize  utility  (e.g.,  as 
perceived  by  a  decision  maker)  under  constraints,  such  as  timeliness  and  resource  usage.  Tools  from  statistical  physics  such 
as  continuum  percolation  theory,  stochastic  geometry,  and  modern  statistics  are  used  to  deal  with  non-Markovian  behaviors, 
heavy-tailed  phenomena,  and  the  mutual  interactions  between  networks.  To  cope  with  multiple  potentially  conflicting  constraints 
and  objectives,  a  generalized  control  theory  of  locally  reacting  multiple  agents  needs  to  be  developed,  perhaps  through  the 
mimicking  of  self-repair  and  replication  identified  in  biological  networks. 

2.2  Human  Metrics  in  Compiex  Networks 

A  critical  element  in  realizing  the  vision  of  Network-Centric  Operations  (NCO)  is  that  the  distributed  decision  making  process 
must  take  into  account  the  human  dimension  in  its  elements  including  decision-makers,  data  and  information  sources, 
processed  information,  communication  network  elements,  etc.  The  measure  or  metric  of  the  human  dimension  must  be 
derived  in  a  distributed  fashion;  in  time-critical  and  stressful  situations;  in  environments  where  node  capture  and  subversion 
are  likely;  where  the  underlying  communications  network  is  resource-constrained,  mobile,  and  dynamic;  and  where  decision- 
makers' reliance  on  and  compliance  with  an  information  network  are  subject  to  numerous  internal  and  external  influences. 
Key  barriers  include  the  limited  understanding  of  network-of-networks,  composability  of  heterogeneous  networks,  and  a  limited 
understanding  of  how  the  context  (e.g.,  trust  and  risk)  is  analytically  derived  from  variously  distributed  sources  of  evidence. 

A  key  aspect  of  a  common  quantitative  framework  is  an  approach  for  representing  uncertainty  (probability  theory,  possibility 
theory,  evidence  theory,  subjective  logic,  argumentation  theory,  or  other).  Development  of  metrics  is  a  key  issue:  metrics  that 
take  into  account  the  value  of  information  and  human  intentions  are  important  in  information  and  C2  networks.  Relations 
between  and  among  these  metrics  need  to  be  clarified  along  with  the  development  of  a  mathematics  of  network  classes.  A 
specific  example  is  the  trust  metric.  The  scientific  challenge  is  to  understand  the  different  definitions  and  dimensions  of  trust, 
for  example,  in  socio-cognitive  and  communications  networks,  and  from  that  understanding  develop  a  composite  trust  metric. 
Rather  than  merely  treating  the  interaction  via  a  simple  interface  or  utility  function  offered  by  one  network  to  another,  network 
interactions  must  be  taken  into  account,  as  well  as  the  context-  and  time-varying  nature  of  the  trust  metric  components. 
Encapsulation  of  these  sub-components  as  a  scalar  metric  is  challenging  due  to  the  conflicting  nature  of  the  components  and 
the  non-convexity  of  useful  metrics.  To  quantify  human  performance  in  networks,  measurable  features  of  data  from  different 
networks  need  to  be  identified  to  enable  the  modeling  of  network  complexity  and  the  interactions  between  different  networks. 
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2.3  Abstracting  Common  Concepts  Across  Fields 

A  set  of  common  concepts  for  network  science  that  is  defined  consistently  across  the  various  scientific  disciplines  does  not 
exist.  There  are  discipline-specific  nomenclatures  that  have  been  developed  for  specialized  needs,  but  the  equivalence  of 
these  terminologies  for  a  discipline-independent  characterization  of  networks  has  not  been  established.  The  mathematical 
formulation  of  these  empirically-based  concepts  will  provide  a  nascent  language  for  network  science. 

Mathematical  rigor  across  the  component  disciplines  is  promoted  by  focusing  on  ubiquitous  aspects  of  the  underlying  complex 
networks,  such  as  the  appearance  of  non-stationary,  non-ergodic,  and  non-renewal  statistical  processes.  These  properties 
are  manifest  through  the  empirical  inverse  power-law  (IPL)  probability  density  functions  (pdf)  that  appear  in  physical,  social, 
biological,  economic,  and  physiological  networks.  Tools  from  non-equilibrium  statistical  physics  and  non-equilibrium  information 
theory  are  used  as  well.  To  understand  and  facilitate  network  agility  and  adaptability,  new  techniques,  methodologies,  and 
analyses  from  a  multi-disciplinary  approach  have  been  and  will  continue  to  be  combined  with  the  application  of  powerful 
modeling  and  simulation  to  explore  the  dynamics  of  networks.  High-fidelity  simulation,  emulation,  and  experiments  for  large- 
scale  dynamic  networks  —  based  on  complexity-performance  trades  and  incorporation  of  live  assets  in  an  emulation  environment 
—  will  provide  us  the  ability  to  understand  the  complicated  interactions  between  large-scale  heterogeneous  networks. 

3  Taxonomy  by  Discipline 

NS  is  unlike  traditional  scientific  disciplines  in  that  it  identifies  a  particular  structure,  that  of  a  network,  as  the  basis  for  the 
science.  Since  structural  webbing  exists  within  and  among  all  scientific  disciplines,  the  over-arching  goal  of  NS  is  to  develop 
a  body  of  principles  that  will  enable  modeling,  design,  analysis,  prediction,  and  control  of  the  behavior  of  the  underlying 
phenomena  in  the  physical  (sensors,  wireless  web),  information  (communications,  knowledge  management),  social  (people, 
organizations,  cultures),  and  cognitive  (perceptions,  beliefs,  decision-making)  sciences,  to  name  only  a  few.  The  emphasis  is 
not  restricted  to  the  network  structure  within  a  given  discipline,  but  is  focused  on  networked  phenomena  that  encompass  many 
disciplines  as  well  as  how  such  complex  networks  interact  with  one  another.  As  a  nascent  science,  the  underlying  principles 
are  still  missing,  so  the  grand  challenge  for  this  effort  is  to  create  and  organize  the  fundamental  knowledge  upon  which  to 
base  these  principles.  Moreover,  NS  is  intended  to  bridge  the  knowledge  gap  between  disciplines  and  breakdown  the  artificial 
barriers  that  have  been  created  between  disciplines. 

We  have  partitioned  the  scientific  papers  into  four  sections,  each  corresponding  to  a  different  category  of  disciplines:  the 
physical  sciences,  the  life  sciences,  the  social  sciences  and  the  information  sciences.  This  partitioning  is  intended  to  direct  the 
reader  to  that  area  of  investigation  with  which  s/he  is  most  comfortable.  On  the  other  hand,  this  opening  narrative  is  intended 
to  suggest  that  NS  is  more  than  just  a  thematic  thread  that  weaves  its  way  through  each  of  the  sections;  it  is  intended  as  an 
over-arching  perspective  that  encompasses  the  traditional  disciplines.  However,  in  the  rush  to  discuss  the  tangible  aspects  of 
science,  the  fieldable  products  of  technology,  and  the  immediate  needs  of  the  Army,  the  network  view  so  crucial  to  the  scientific 
enterprise  is  often  overlooked.  It  is  emphasized  here  because,  like  the  scientific  barrier  of  complexity,  it  often  represents  the 
problem  we  would  like  to  solve,  but  do  not  know  how. 

3.1  Physical  Sciences 

Physics  has  been  the  paradigm  of  science  since  the  time  of  Newton.  Consequently,  when  investigators  wanted  to  emphasize 
the  use  of  quantitative  methods  in  their  studies,  they  appended  the  suffix  physics.  In  the  19th  century  this  strategy  resulted  in 
sociology  giving  rise  to  sociophysics,  psychology  spawning  psychophysics  and  biology  producing  biophysics.  Of  course,  there 
were  also  a  number  of  physicists  speculating  on  the  use  of  their  techniques  to  understand  non-physical  phenomena.  As  a 
result,  new  disciplines  evolved  at  the  boundary  between  previously  established  disciplines.  This  way  of  extending  the  frontiers  of 
disciplinary  science  proceeded  into  the  20th  century,  producing  biochemistry,  bioengineering,  social  engineering,  econophysics, 
and  so  on.  But  it  was  the  successes  of  the  mathematical  methods  of  physics  that  attracted  the  most  attention. 

The  existence  of  an  empirical  law  or  principle  from  which  the  dynamics  of  a  physical  system  can  be  determined  appears  almost 
mystical  when  trying  to  understand  cognition  or  decision  making.  The  19th  century  physicists  and  applied  mathematicians  were 
able  to  take  the  conservation  of  energy  and  from  that  statement  determine  the  laws  of  motion  for  any  mechanical  system.  Such 
variational  calculations  in  the  hands  of  more  practical  people  subsequently  produced  control  theory  and  the  dynamics  of  non- 
mechanical  systems  consistent  with  a  set  of  imposed  constraints. 

Sardellitti  and  Barbarossa  of  Sapienza-University  of  Rome  collaborated  with  Swami  of  ARL  on,  "Optimal  Topology  Control 
and  Power  Allocation  for  Minimum  Energy  Consumption  in  Consensus  Networks"  (page  21),  to  design  a  network  topology 
that  optimizes  the  spread  of  information,  exemplified  by  the  computation  of  globally  relevant  statistics  by  exploiting  only  local 
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communication  among  the  sensors  in  a  wireless  network.  As  in  all  design  problems,  there  is  a  function  to  be  carried  out  by  the 
network  while  simultaneously  satisfying  a  given  constraint  such  as  maximizing  efficiency  or  minimizing  energy  consumption.  Is 
the  optimal  topology  sparse  or  dense?  The  research  of  Sardellitti  et  al.  was  concerned  with  minimizing  energy  use;  the  paper 
of  Hollingshad  et  al.,  discussed  later,  was  concerned  with  network  efficiency,  "A  New  Measure  of  Network  Efficiency"  (page 
13).  Sardellitti  et  al.  developed  a  method  for  optimizing  the  network  topology  and  the  power  allocation  across  every  link  in 
order  to  minimize  the  energy  necessary  to  achieve  consensus  across  the  sensor  network.  Since  the  topology  is  the  result  of 
optimization,  this  is  a  non-deterministic  polynomial  time  (NP)  combinatorial  problem  which  the  authors  circumvent  by  means 
of  a  clever  relaxation  procedure. 

Another  approach  to  handling  network  complexity  is  by  recognizing  that  the  elements  comprising  a  fighting  force  are  interlinked 
to  such  a  high  degree  that  changing  a  single  component  influences  and  modifies  the  response  of  every  other  component  in 
the  network.  Furthermore,  each  element  in  the  network  is  itself  a  network  and  must  be  treated  as  such.  This  observation  does 
not  in  itself  contribute  anything  new  to  the  discussion.  It  has  been  known  since  the  first  recorded  wars  that  small  changes  in 
tactics  and  strategy  can  lead  to  overwhelming  changes  in  outcome.  This  recognition  has  sometimes  led  to  inaction,  because  the 
outcome  of  any  action  is  unpredictable.  What  is  new  about  the  networking  concept  is  the  attempt  to  formalize  the  interactions 
so  as  to  make  the  response  to  change  predictable.  New  methods,  such  as  nonlinear  dynamics  theory  and  chaos,  have  enabled 
us  to  determine  the  boundaries  of  predictability  in  apparently  random  dynamic  processes,  so  we  can  realistically  assess  what 
we  can  and  cannot  know  about  a  particular  network.  But  it  all  goes  back  to  the  individual  Soldier,  situational  awareness,  and 
leadership  involving  the  synthesis  of  information  in  real  time. 

The  research  group  consisting  of  Hollingshad,  Turalska,  and  Grigolini  at  the  University  of  North  Texas;  Allegrini  from  the  University 
of  Pisa;  and  West  from  ARL  collaborated  on  constructing,  "A  New  Measure  of  Network  Efficiency"  (page  13).  They  address  the 
dynamical  origin  of  the  scale-free  probability  distribution  function  (pdf)  and,  consequently,  the  related  issue  of  network  topology. 
Here  again,  the  local  interactions  are  found  to  determine  the  global  properties  of  the  network,  using  a  nonlinear  mathematical 
model  with  nearest  neighbor  interactions  on  a  two-dimensional  lattice.  The  network  dynamics  undergo  a  phase  transition  at 
a  critical  value  of  the  control  parameter,  thereby  inducing  long-range  correlations  among  the  members  of  the  network.  The 
researchers  determine  that  the  distribution  of  links  favoring  long-range  connections  increases  the  efficiency  of  information 
transfer,  which  becomes  optimally  efficient  when  the  IPL  index  for  the  degree  pdf  is  one.  They  attribute  this  to  the  role  of  network 
leadership  being  transient,  with  the  rank  differences  between  the  most-linked  and  least-linked  nodes  being  temporary  and  over 
time  the  highest  rank  is  shared  by  all  members  of  the  network.  The  full  implications  of  this  insight  remain  to  be  explored. 

A  fundamental  difficulty  with  the  concept  of  networking  networks,  such  as  nature  has  done  within  the  human  body  or  as  applied 
to  material  structures,  is  related  to  the  multiple  scale  and  multi-physics  requirements,  as  seen,  for  example,  in  electronic 
networks.  If  all  physics  were  known  and  we  had  the  necessary  mathematical  tools,  we  could  model  the  ultimate  performance 
of  any  physical  network  given  the  atomic  composition  of  the  parts.  Quantum  physics  would  enable  atomic  and  nano-scale 
properties  to  be  captured  and  parameterized  yielding  the  electronic,  thermal,  and  mechanical  attributes  of  materials.  From  this 
ability  to  effectively  couple  across  scales,  we  could  eventually  determine  the  capabilities  and  performance  of  multifunctional 
networks  interrelating  sensor,  information,  and  communication  capabilities.  We  could  do  this  without  fabricating  network 
components,  and  we  could  achieve  optimization  at  a  level  of  sophistication  that  is  beyond  our  wildest  dreams  using  current 
technology  and  network  development.  On  the  other  hand,  it  seems  unreasonable  to  model  every  atom  and  electron  in  a  physical 
network,  yet  the  atomic  level  properties  affect  the  composite  performance  and,  therefore,  cannot  be  neglected.  At  criticality, 
the  smallest  fluctuations  are  often  amplified  to  the  macroscopic  level  and,  therefore,  contain  information  on  the  nature  of  the 
large-scale  network. 

Ren  and  Zhao  of  the  University  of  California,  Davis  have  collaborated  with  Swami  of  ARL  on,  "Connectivity  of  Heterogeneous 
Wireless  Networks"  (page  53).  This  research  results  in  a  better  understanding  of  the  spatial  connectivity  of  one  heterogeneous 
network  (secondary)  embedded  within  another  (primary);  the  heterogeneity  is  modeled  using  Poisson  statistics  for  the  ad  hoc 
network.  The  analysis  is  done  using  percolation  theory,  which  was  initiated  in  statistical  physics  and  has  since  developed  into 
its  own  branch  of  mathematics.  Percolation  theory  is  a  natural  method  for  answering  the  question  of  whether  the  secondary 
network  can  be  connected,  i.e.,  when  can  a  message  be  transmitted  across  the  entire  network  without  disruption?  The 
researchers  establish  that  when  the  density  of  the  primary  transmitters  is  beyond  a  certain  level,  there  are  simply  not  enough 
spectrum  opportunities  for  any  secondary  network  to  be  connected.  The  design  parameters  are  the  interference  tolerances 
of  the  primary  and  secondary  Poisson  distributed  networks  and  are  used  to  design  the  optimal  transmission  power  of  the 
secondary  network  based  on  that  of  the  primary.  Consequently,  the  connectivity  of  large-scale  ad  hoc  heterogeneous  wireless 
networks  are  expressed  in  terms  of  the  occurrence  of  the  percolation  phenomenon.  It  turns  out  that  matching  the  interference 
ranges  of  the  primary  and  secondary  networks  maximizes  the  tolerance  of  the  secondary  network  to  the  primary  traffic  load. 
Since  the  tolerable  interference  range  is  an  indicator  of  the  receiver's  sophistication,  this  can  be  interpreted  as  an  instance  of 
the  complexity  matching  principle  discussed  in,  "Transmission  of  Information  Between  Complex  Systems:  l/f  Resonance"  (page 
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233),  and  "Temporal  Complexity  of  the  Order  Parameter  at  the  Phase  Transition"  (page  213  ). 

A  systematic  procedure  based  on  multiple  models  at  different  scales  with  representations  at  adjacent  scales  linked  to  each 
other  by  defined  parameterized  interfaces  that  can  be  calibrated  to  measurement  is  one  reasonable  approach  to  multi-scale 
modeling.  This  is  done  today  through  a  combination  of  deterministic  and  stochastic  modeling,  where  the  smaller  scales  give  rise 
to  statistical  uncertainty  at  the  larger  scales.  Thus  multi-scale  modeling  involves  multiple  organizations  and  disciplines  leading 
to  multiple,  seemingly  incompatible  approaches.  One  school  of  thought  believes  it  essential  that  an  object-oriented  approach 
be  followed,  meaning  in  this  context  that  there  are  well  defined  interfaces  to  models  capturing  a  particular  scale  and  that  the 
various  scales  are  captured,  most  likely,  by  stand-alone  programs.  This  multi-scale  modeling  is  a  grand  scientific/engineering 
challenge  for  Army  research. 

Metz  at  the  University  of  Ulm;  Mitasova  at  North  Carolina  State  University;  and  Harmon  at  ARL  collaborated  on,  "Efficient 
Extraction  of  Drainage  Networks  from  Massive,  Radar-Based  Elevation  Models  with  Least  Cost  Path  Search"  (page  39).  They 
use  a  least-cost  drainage  path  (LCP)  search  methodology  to  significantly  improve  the  accuracy  of  stream  mapping,  especially  in 
remote,  hard  to  reach  regions  and  those  that  can  be  mapped  with  massive  data  sets.  The  LCP  search  methods  were  designed 
to  find  the  shortest  or  fastest  route  from  a  starting  point  to  a  given  destination.  The  success  in  extracting  reliable  drainage 
networks  from  massive  radar-based  data  sets  indicates  that  nature  must  optimize  a  cost  function  over  geophysical  time  scales 
in  direct  analogy  to  the  variation  in  total  energy  determining  the  dynamics  of  mechanical  systems.  Here  the  cost  is  interpreted 
in  terms  of  local  elevation  and  the  least-cost  path  is  minimal  elevation  contour. 

3.2  Life  Sciences 

Complex  networks  in  the  life  sciences  generate  patterns  that  have  been  summarized  in  a  number  of  empirical  relations.  In 
physics  we  have  Ohm's  law,  the  perfect  gas  law,  and  other  empirical  relations  among  the  variables  describing  the  physical 
process.  These  "laws"  are,  of  course,  phenomenological  rules  developed  to  summarize  what  has  been  observed  experimentally 
and  that  were  eventually  derived  using  fundamental  theory.  In  more  complex  living  networks,  equally  simple  rules  have  been 
uncovered  that  typically  relate  a  particular  function  to  the  size  of  an  organism,  as  first  observed  in  biology  some  two  hundred 
years  ago.  The  first  such  expression  related  the  mass  of  an  animal's  brain  to  its  total  body  mass,  using  mass  as  a  measure  of 
size  and  giving  rise  to  the  nomenclature  of  allometry  relation  (AR).  Allometry,  literally  meaning  by  a  different  measure,  has  been 
defined  as  the  study  of  body  size  and  its  consequences  both  within  a  given  organism  and  between  species  in  a  given  taxon. 

The  history  of  ARs  is  presented  by  West  at  Renesselaer  Polytechnique  Institute  and  West  at  ARL,  who  critiqued  the  various 
attempts  to  construct  the  theory  underlying  the  scaling  observed  in  ARs  in,  "On  Allometry  Relations"  (page  115).  ARs  are 
critically  important  because  they  set  the  boundary  conditions  that  network  models  must  satisfy  in  order  to  be  compatible  with 
existing  data.  For  example,  the  fractal  network  nutrient  transport  model  used  to  successfully  derive  the  metabolic  AR  is  one  of 
the  network  applications  reviewed. 

Another  emerging  application  of  network  theory  is  in  analyzing  interactions  and  relationships  in  groups  of  animals  and/or 
humans.  The  network  science  methodologyallowsfor  better  understandingof  the  complexity  of  interactions  and  communications 
between  individuals  in  groups.  The  review  by  Pollard,  formerly  at  the  University  of  California,  Los  Angeles  and  now  with  ARL, 
and  Blumstein  at  ARL,  "Evolving  Communicative  Complexity:  Insights  from  Rodents  and  Beyond"  (page  87),  discusses  the 
relationship  between  social  interactions  and  communication  patterns,  and  how  focused  studies  of  communicative  complexity 
can  better  untangle  the  underlying  evolutionary  relationships.  The  functional  relationship  between  attributes  of  social  complexity 
and  attributes  of  communicative  complexity  allow  for  a  better  understanding  of  how  socialization  and  communication  evolve. 

The  importance  of  interactions  in  neural  networks  for  understanding  of  the  operation  of  the  brain  has  been  recognized  for  a 
longtime.  However,  only  recently  have  advances  in  theoretical,  experimental,  and  computational  approaches  to  the  study  of 
neural  networks  begun  to  reveal  fundamental  principles  of  brain  structure  and  function.  There  is  emerging  consensus  among 
neuroscientists  that  traditional  approaches  concentrating  on  identifying  brain  structures  responsible  for  specific  functions  and 
behavior  need  to  be  expanded  to  include  investigations  of  interactions  between  these  structures.  New  brain  imaging  techniques 
allow  for  more  precise  identification  of  neural  tracts  connecting  different  regions  of  the  brain,  providing  information  necessary 
for  creation  of  the  structural  connectome  network  of  an  individual  brain.  This  allows  for  analyzing  and  interpreting  characteristic 
features  in  different  individuals  as  well  as  identifying  abnormalities  caused  by  neurotrauma.  The  article  by  Kraft,  Mckee,  and 
Dagro  at  ARL  in  collaboration  with  Grafton  at  the  University  of  California,  Santa  Barbara,  "Combining  the  Finite  Element  Method 
with  Structural  Connectome-Based  Analysis  for  Modeling  Neurotrauma:  Connectome  Neurotrauma  Mechanics"  (page  99), 
presents  a  novel  approach  to  analysis  and  interpretation  of  structural  connectome  obtained  from  MRI  images  by  constructing  a 
finite  element  model  (FEM)  informed  by  experimental  data  at  a  cellular  level.  This  approach  integrates  brain  injury  biomechanics 
and  graph  theoretical  analysis  of  neuronal  connections  to  form  a  computational  model  capturing  spatiotemporal  characteristics 
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of  trauma.  Time  evolving  simulations  of  the  trauma  investigated  in  this  study  show  how  brain  network  measures  of  global 
and  local  efficiency  are  degrading  in  time  after  trauma.  This  approach  provides  a  basis  for  quantitative  analysis  tools  for 
interpretation  of  neuroimaging  data.  Computational  models  based  on  theoretical  analysis  of  neural  network  connections  could 
help  inform  subsequent  analysis  of  functional  networks. 

Neural  network  models  based  on  neurophysiological  properties  of  neurons  provide  insight  into  functional  relationships  between 
different  brain  structures.  In  recent  years,  due  to  increased  computational  capabilities,  it  has  become  possible  to  model  larger 
networks  composed  of  more  realistic  neurons.  This  allows  for  more  realistic  simulations  of  normal  and/or  pathological  brain 
function,  which  can  be  directly  compared  with  experimental  or  even  clinical  measurements.  The  article  by  Anderson,  Azhar, 
Kudela,  and  Bergey  at  Johns  Hopkins  University  in  collaboration  with  Franaszczuk  at  ARL,  "Epileptic  Seizures  from  Abnormal 
Networks:  Why  Some  Seizures  Defy  Predictability"  (page  73),  illustrates  how  this  modeling  approach  can  provide  some  insights 
into  observed  but  difficult  to  interpret  phenomena  within  the  human  brain.  Epileptic  seizures  are  the  pathological  activity  of 
the  brain  disrupting  the  normal  activity  of  an  afflicted  subject.  While  a  seizure  usually  lasts  only  for  a  relatively  short  period 
of  time  (1-2  minutes),  its  unpredictability  causes  significant  disruption  in  the  life  of  the  subject  and  may  even  cause  serious 
injury  due  to  associated  loss  of  consciousness  and  body  control.  There  is  limited  opportunity  to  record  data  associated  with 
epileptic  seizures.  Even  during  hospitalization,  patients  may  have  as  few  as  one  or  two  seizures  during  an  entire  week.  The 
computationally  realistic  modeling  allows  for  investigation  of  multiple  simulated  seizures  with  full  control  of  critical  parameters. 
In  Anderson  et  al.,  the  authors  created  a  computational  model  of  the  cortex  consisting  of  more  than  60,000  spiking  neurons 
of  several  types  based  on  histological  data.  Analysis  of  simulations  of  neuronal  activity  in  this  model  suggest  that,  in  the  region 
of  the  cortex,  with  abnormal  connectivity  analogous  to  seizure  focus  in  the  human  brain,  it  is  possible  to  initiate  seizure  activity 
with  random  fluctuations  of  input  from  the  surrounding  cortical  regions.  This  mechanism  helps  explain  the  difficulty  in  predicting 
partial  seizures  in  some  patients  and  suggests  that  intervention  not  only  in  the  seizure  focus  area,  but  also  in  surrounding 
areas,  may  be  necessary  to  prevent  seizures  from  occurring. 

The  continuing  research  at  ARL  is  directed  toward  combining  the  structural  modeling  of  brain  networks,  "Combining  the  Finite 
Element  Method  with  Structural  Connectome-Based  Analysis  for  Modeling  Neurotrauma:  Connectome  Neurotrauma  Mechanics" 
(page  99),  with  neurophysiological  models  similar  to  those  presented  in  "Epileptic  Seizures  from  Abnormal  Networks:  Why  Some 
Seizures  Defy  Predictability"  (page  73)  to  create  a  more  complete  model  of  brain  neural  networks. 

The  diverse  applications  of  network-based  or  inspired  studies  in  life  sciences  presented  here  show  that  the  network  sciences 
concepts  and  methodology  provide  a  useful  conceptual  platform  for  description  and  analysis  of  various  phenomena  and 
processes.  However,  we  are  in  the  early  stages  of  applying  NS  concepts  to  the  various  life  sciences.  Development  of  more 
specific  mathematical  and  computational  tools  is  required,  but  the  future  applications  are  limitless. 

3.3  Social  Sciences 

In  tactical  networks,  distributed  decision  making  should  take  into  account  trust  in  the  elements:  the  sources  of  information,  the 
processors  of  information,  the  elements  of  the  communications  network  (both  radio  and  other)  across  which  the  information 
is  transmitted,  etc.  This  trust  must  often  be  derived  under  time-critical  conditions,  and  in  a  distributed  way.  It  must  account  for 
the  uncertain  provenance  of  the  data,  including  the  fact  that  information  sources  and  relays  in  a  tactical  network  are  subject 
to  subversion.  Trust  is  a  critical  element  of  networked  interactions,  including  interactions  between  humans  and  agents,  that 
may  be  direct  or  computer-mediated.  Trust  and  reputation  are  becoming  increasingly  important  due  to  the  explosive  growth 
in  electronically  mediated  social  interactions,  often  with  sometimes  correspondingly  weakened  social  control.  Trust  is  a  multi- 
faceted  concept  and  depends  upon  the  context,  the  trustor,  the  environment,  the  infrastructure,  and  the  mediating  agents  or 
relays.  Trust  is  a  critical  element  of  modeling  team,  group,  squad,  and  company  behaviors.  It  must  be  taken  into  account  in 
promoting  group  cohesivenessand  effective  problem-solving,  and  it  must  be  adequately  modeled  in  quantifying  the  tension  and 
trade-off  between  individual  utility  and  group  welfare.  Technology  can  enhance  as  well  as  weaken  trust  relationships.  Currently 
used  models  of  trust  do  not  adequately  capture  critical  elements  of  human  trust  (such  as  lack  of  transitivity,  symmetry,  and 
reciprocity,  which  calls  for  novel  mathematical  tools).  Managing  trust  in  such  an  environment  is  challenging.  Cho  and  Swami 
from  ARL  collaborated  with  Chen  from  Virginia  Tech  on  the  paper,  "A  Survey  of  Trust  Management  in  Mobile  Ad  Hoc  Networks" 
(page  189),  which  addresses  some  of  these  challenges.  It  discusses  the  concepts  and  properties  of  trust  pertinent  to  the 
constituent  elements  of  a  tactical  network  and  proposes  composite  trust  metrics  that  take  into  account  the  interaction  between 
the  constituent  networks,  the  resource  constraints,  and  the  mission  goals.  This  comprehensive  survey  discusses  the  differences 
between  trust  and  trustworthiness,  trust  and  risk,  and  trust  management  schemes. 

How  can  one  maximize  rewards  in  an  unknown  environment?  Learning  is  clearly  a  crucial  element  here.  There  are  two  important 
performance  criteria  to  assess  the  efficacy  of  learning:  consistency  and  regret  bounds.  A  learning  algorithm  is  consistent  if  the 
learned  estimates  asymptotically  converge  (in  some  sense)  to  the  true  values.  Regret  is  a  measure  of  the  speed  of  convergence. 
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the  time-averaged  loss  in  performance  compared  with  that  obtained  by  a  genie.  Anandkumar  from  the  University  of  California, 
Irvine,  and  Michael  and  Tang  from  Cornell  University  collaborated  with  Swami  from  ARLon  aspects  of  this  problem  in  the  paper, 
"Distributed  Algorithms  for  Learning  and  Cognitive  Medium  Access  with  Logarithmic  Regret"  (page  173).  The  scenario  they 
considered  is  related  to  the  problem  studied  in  "Connectivity  of  Heterogeneous  Wireless  Networks"  (page  53).  They  consider  the 
problem  of  so-called  secondary  users  learning  the  unknown  rewards  (perhaps  rates)  obtained  by  accessing  a  set  of  channels. 
They  consider  a  distributed  framework  where  there  is  no  information  exchange  or  prior  agreement  among  the  secondary  users. 
This  introduces  additional  challenges:  First,  one  can  expect  loss  of  performance  due  to  collisions  among  users.  Second,  there 
is  competition  since  all  the  secondary  users  are  vying  to  access  channels  with  the  highest  rewards.  Using  the  theory  of  multi- 
armed  bandits,  they  analyze  the  trade-off  between  exploration  and  exploitation  and  between  cooperation  and  competition.  They 
propose  an  order  optimal  distributed  learning  and  access  policy  that  minimizes  the  regret.  An  important  notion  here  is  that 
one  must  learn  from  mistakes  (collisions  and  sensing  that  may  be  imperfect)  and  that  one  can  never  stop  learning  (one  must 
continue  to  explore). 

Cooperative  interactions  can  lead  to  phase  transitions  and  the  emergence  of  spatial  and  network  complexity  in  many  systems. 
The  Ising  model  has  been  used  to  model  phase  transitions  in  a  variety  of  phenomena,  including  ferromagnetism,  biological 
and  neurophysiological  processes,  the  flocking  of  birds,  econophysics,  etc.  But  the  temporal  complexity  of  such  networks  had 
not  been  explored  until  recently.  Turalska  and  Grigolini  from  the  University  of  North  Texas  collaborated  with  West  of  ARL  on 
the  paper,  "Temporal  Complexity  of  the  Order  Parameter  at  the  Phase  Transition"  (page  213),  which  explores  the  temporal 
complexity  issue.  They  study  a  decision  making  model  in  which  the  time  intervals  between  significant  events  has  a  power-law 
distribution,  which  they  show  is  equivalent  to  a  two-dimensional  Ising  model.  The  underlying  process  is  a  renewal  process, 
but  it  is  non-stationary  and  exhibits  long-term  correlations.  Thus  the  fundamental  property  of  ergodicity  no  longer  holds,  and 
new  mathematical  tools  are  required  to  analyze  these  non-Poisson  renewal  processes.  The  researchers  establish  that  phase 
transition  leads  to  temporal  complexity  as  manifested  by  non-stationary  and  non-ergodic  fluctuations.  They  argue  that  this 
is  a  basis  for  complexity  matching,  a  notion  elaborated  on  in,  "Transmission  of  Information  Between  Complex  Systems:  1/f 
Resonance"  (page  233).  The  pair  of  papers,  "Temporal  Complexity  of  the  Order  Parameter  at  the  Phase  Transition"  (page  213) 
and  "Transmission  of  Information  Between  Complex  Systems:  l/f  Resonance"  (page  233)  indicate  that  information  can  be 
transferred  between  networks  via  extremely  weak  coupling,  provided  that  the  complexity  of  the  two  networks  is  matched  (in 
the  sense  that  a  form  of  resonance  exists).  The  concepts  of  complexity  management  and  complexity  matching  are  likely  to  find 
wide  applicability  in  NS. 

3.4  Information  Sciences 

In  the  social  and  life  sciences,  network  discovery  has  led  to  increased  understanding  of  how  the  brain  works  and  how 
information  propagates  in  communities  of  people.  In  the  information  sciences,  one  wishes  to  create  a  physical  network  that 
enables  communication  between  entities,  either  person-to-person,  person-to-machine,  or  machine-to-machine.  Design  of  the 
physical  network  must  account  for  the  manner  and  the  environment  in  which  the  network  will  be  used. 

In  most  non-military  situations,  the  backbone  of  this  network  is  fixed,  for  example,  through  cell  towers.  Senders  and  receivers 
are  tethered  to  the  backbone,  that  is,  fixed  spatially  and  hard-wired,  or  untethered,  physically  mobile,  and  wirelessly  connected. 
However,  in  the  environments  in  which  the  military  operates,  designers  cannot  expect  a  wireless  infrastructure  to  be  available  for 
use.  Thus,  military  units  must  be  capable  of  establishing  and  maintaining  their  own  network,  which  is  a  challenge  during  mobile 
operations.  It  is  even  more  of  a  challenge  when  nodes  on  the  network  are  small,  autonomously  mobile  platforms  operating  on 
city  streets  and  inside  buildings,  where  floor  plans  and  building  materials  affect  the  viability  of  links  between  nodes. 

The  use  of  multiple  autonomous  platforms  is  inspired  by  bionetworks  in  nature  such  as  insect  swarms.  Although  each  single 
entity  in  a  swarm  has  limited  intelligence,  the  swarm  appears  to  behave  with  intent  when  the  entities  function  in  concert. 
Consider  the  behavior  of  social  insects,  such  as  bees  and  ants.  Fish  and  birds  are  other  examples  of  animals  whose  collective 
behavior  belies  their  limited  intelligence.  Although  this  analogy  is  intellectually  appealing,  differences  exist  between  bionetworks 
and  networks  of  autonomous  mobile  platforms  that  make  it  difficult  to  implement.  Bionetworks,  for  example,  swarm  for  survival, 
either  to  seek  food  or  provide  protection.  In  our  grandest  vision,  swarms  of  autonomous  platforms  will  exhibit  complex  behavior, 
such  as  intelligence  or  sentience  through  "situational  awareness."  Further,  communication  within  animal  swarms  occurs 
through  proximity,  e.g.,  ants  touching  antennae  or  following  a  chemical  trail,  line-of-sight  vision,  or  sound.  Put  simply,  biological 
swarms  communicate  simple  messages  primitively  to  perform  primitive  behaviors,  whereas  our  vision  of  autonomous  platforms 
requires  complex  messaging  to  achieve  complex  behaviors. 

Given  the  present  state  of  technology,  a  better  analogy  may  be  a  shepherd  herding  sheep  using  a  sheep  dog.  That  is,  a  single 
intelligent  entity  controls  a  layered  hierarchical  network  wherein  each  layer  contains  more  entities  of  lesser  intelligence  than 
the  layer  above  it.  Whether  a  swarm  or  a  pack  is  the  better  analogy,  in  any  case,  researchers  have  only  scratched  the  surface 
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of  issues  concerned  with  designing  a  mobile  network  created  by  a  collective  of  autonomous  platforms.  Controlling  the  mobility 
of  a  pack  of  autonomous  platforms  and  communicating  within  it  remain  unsolved  problems. 

How  platforms  move  to  achieve  their  mission,  for  example,  generating  the  floor  plan  of  a  building  previously  unexplored  by  US 
units,  yet  maintain  connectivity  is  the  focus  of  the  paper,  "Robust  Control  for  Mobility  and  Wireless  Communication  in  Cyber- 
Physical  Systems  with  Application  to  Robot  Teams,"  by  ARL  researcher  Fink  and  colleagues  Riberio  and  Kumar  at  the  University 
of  Pennsylvania.  The  key  element  of  the  paper,  "Robust  Control  and  Mobility  and  Wireless  Communication  in  Cyber-Physical 
Systems  With  Application  to  Robot  Teams"  (page  247),  is  the  authors'  statement  that  "successful  deployment  of  an  autonomous 
team  of  robots  requires  joint  cyber-physical  controllers  that  determine  (physical)  trajectories  for  the  robots  while  ensuring 
(cyber)  availability  of  communication  resources."  The  authors'  approach  to  solving  this  problem  is  to  consider  communication 
parameters  (bandwidth,  attenuation,  noise)  as  virtual  obstacles  that  restrict  platform  movement.  However,  in  an  iterative 
control  cycle,  after  platforms  move,  the  network  readjusts  communications  parameters  to  provide  reliable  communication  (e.g., 
establishes  new  links,  removes  old  ones,  increases  transmitted  power,  or  reduces  transmission  bandwidth). 

The  basis  for  control  is  minimizing  a  potential  function  subject  to  the  constraint  that  data  transmission  in  every  established  link 
is  above  some  threshold  data  rate.  The  strength  of  this  approach  is  its  global  perspective  of  all  links.  Previous  methods  consider 
mobility  from  the  perspective  of  local  links  only.  Simulations  and  experiments  with  five  mobile  ground  platforms  validate  the 
viability  of  this  approach. 

Analogies  to  nature  inform  the  design  of  these  experiments.  In  addition,  they  provide  a  benchmark  against  which  one  can 
measure  performance.  However,  the  experiments  provide  a  perspective  for  reinterpreting  and  understanding  nature.  Does 
the  collection  of  individual  movement  and  communication  minimize  a  global  potential?  If  so,  what  potential?  The  total  energy 
expended  by  the  network  to  achieve  goal?  How  does  the  mode  of  communication  impact  an  individual's  movement  if  the 
communication  is  non-line-of-sight  versus  line-of-sight?  The  answers  to  these  questions  fall  back  into  the  realm  of  network 
science  theory. 

Designers  of  the  physical  networks  that  enable  the  transfer  of  electronic  information  need  to  consider  means  for  simple,  rapid, 
and  robust  data  transfer.  In  applications  where  the  network  is  stable  and  network  operators  have  considerable  opportunity  to 
monitor  and  control  the  network,  for  example,  commercial  networks,  message  traffic  can  be  handled  sequentially.  However,  in 
complex,  less  stable,  environments,  parallel  transmission  of  messages  provides  more  robust  operation.  In  this  case,  messages 
between  two  channels  are  transmitted  contemporaneously  to  a  relay.  The  relay  combines  the  messages  and  retransmits  the 
same  composite  message  to  both  channels.  Since  each  channel  knows  the  signal  they  transmitted,  it  is  a  simple  matter  for 
them  to  extract  the  unknown  message  from  the  other  channel. 

However,  parallel  messaging  dictates  that  messages  between  channels  must  arrive  within  certain  time  blocks,  which  requires 
timing  control  to  minimize  transmission  delays.  As  the  likelihood  of  large  delays  increases,  the  likelihood  of  missed  messages 
also  increases.  Although  coherent  detection,  in  which  all  channels  operate  on  the  same  time  base,  minimizes  timing  delays, 
it  requires  precise  control  of  phase.  Such  control  is  unlikely  in  unstable  environments.  In  these  situations,  designers  trade  the 
precision  of  coherent  detection  for  the  simplicity  of  non-coherent  detection,  which  does  not  require  a  frequency  or  phase  base. 
In  their  implementation  of  a  non-coherent  system,  Valenti  and  Ferrett  of  West  Virginia  University  and  Torrieri  of  ARL,  "Non- 
coherent Physical-Layer  Network  Coding  with  FSL  Modulation:  Relay  Receiver  Design  Issues"  (page  221),  use  frequency  shift 
keying  (FSK)  to  encode  signals.  In  FSK,  the  network  needs  to  detect  only  changes  in  frequency,  not  measure  absolute  frequency, 
to  demodulate  signals.  Because  the  encoding  does  not  require  a  reference  frequency,  it  reduces  network  requirements  on 
frequency  control  and  on  power  control. 

In  this  paper,  Valenti,  Torrieri  and  Ferrett  present  the  theory,  design,  and  simulated  operation  of  FSK  for  non-coherent  physical- 
layer  network  coding.  They  show  that  the  encoding  provides  high  throughput  with  low  energy  efficiency,  which  is  typical  of  non- 
coherent systems.  To  overcome  power  losses,  they  employ  low  rate  turbo  codes.  The  authors  show  that,  for  the  same  energy 
efficiency,  their  digital  network  coding  (DNC)  approach  achieves  a  higher  throughput  rate  in  comparison  to  more  traditional 
link-layer  network  coding  (LNC).  In  one  example,  the  authors  show  a  one-third  rate  increase.  That  is,  four  messages  can  be  sent 
using  the  authors'  approach  for  every  three  messages  transmitted  using  LNC. 

Aquino  of  the  Imperial  College  London,  Bologna  of  the  Instituo  de  Alta  Investigacion  Chile,  Grigolini  of  the  University  of  North 
Texas,  and  West  of  ARL  investigated  the  transport  of  information  between  two  complex  networks  to  determine  the  conditions 
under  which  the  information  transport  is  maximized  in,  "Transmission  of  Information  Between  Complex  Systems:  l/f  Resonance" 
(page  233).  If  network  theory  is  ever  to  become  Network  Science,  there  must  exist  general  principles  that  transcend  the  level  of 
disciplinary  mechanisms  and  that  remain  valid  across  the  physical,  biological,  and  social  phenomena  described  above.  In  this 
paper,  Aquino  et  al.  use  arguments  from  non-equilibrium  statistical  physics  to  make  the  case  that  if  the  complexity  of  a  network 
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can  be  quantified,  then  the  information  transferred  is  maximized  through  the  matching  of  the  complexity  of  the  interacting 
networks. 

The  complexity  of  networks,  from  avalanches  of  neuronal  firings  within  the  human  brain  to  the  blinking  times  in  colloidal 
quantum  dot  fluorescence,  is  manifest  in  IPL  distributions.  The  level  of  complexity  of  these  networks  is  quantified  by  the  value 
of  the  IPL  index.  Of  particular  importance  is  the  manner  in  which  information  is  shuttled  back  and  forth  between  such  networks 
and  whether  or  not  there  exists  a  general  principle  that  guides  the  flow  of  information.  The  mathematician  Norbert  Wiener 
speculated  that  such  a  principle  existed  for  information-dominated  processes.  A  half  century  later,  Aquino  et  al.  in,  "Transmission 
of  Information  Between  Complex  Systems:  l/f  Resonance"  (page  233),  proved  Wiener's  conjecture  to  be  true  by  establishing 
the  Principle  of  Complexity  Management  in  which  information  transport  is  mitigated  by  the  relative  values  of  the  networks'  IPL 
indices.  They  establish  the  principle  by  generalizing  the  fluctuation-dissipation  theorem  from  non-equilibrium  statistical  physics 
to  non-ergodic  processes.  In  this  way,  it  is  possible  for  a  network  with  lesser  energy,  but  greater  information,  to  organize  a 
network  with  greater  energy,  but  lesser  information;  this  is  an  information-dominated  process. 

On  one  hand,  a  speaker  who  delivers  a  lecture  standing  motionless  in  front  of  an  audience,  talking  in  a  monotone,  transfers 
little  or  no  information.  The  group  very  quickly  drifts  off  and  stops  listening  almost  independently  of  the  content  of  the  lecture. 
On  the  other  hand,  another  speaker  with  a  modulated  voice,  striding  back  and  forth  across  the  front  the  room  while  gesturing 
dramatically,  can  hold  the  audience's  attention.  The  complexity  of  the  presentation  matches  that  of  the  typical  cognitive  activity 
of  the  brain  and  information  finds  its  way  into  the  consciousness  of  the  audience  member.  The  success  of  the  second  speaker, 
as  well  as  a  myriad  of  other  social  and  psychological  phenomena,  are  explained  by  the  Principle  of  Complexity  Management  and 
the  relative  complexity  of  the  two  interacting  networks. 

The  following  collection  of  recently  published  journal  articles  is  a  select  sample  of  the  broad  spectrum  of  ARL's  network  science 
research  investigating  physical,  social/cognitive,  information,  and  communication  networks,  and  their  complex  interactions. 
They  represent  foundational  knowledge  that  will  enable  optimized  human  performance  and  enhanced  speed  and  precision  in 
complex  networked  environments. 
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We  address  the  issue  of  the  dynamical  origin  of  scale-free  link  distributions.  We  study 
a  two-dimensional  lattice  of  cooperatively  interacting  units.  Although  the  units  interact 
only  with  the  four  nearest  neighbors,  a  sufficiently  large  cooperation  strength  generates 
dynamically  a  scale-free  network  with  the  power  law  index  v  approaching  1.  We  explain 
this  result  by  using  a  new  definition  of  network  efficiency  determined  by  the  Euclidean 
distance  between  correlated  units.  According  to  this  definition  the  link  distribution 
favoring  long-range  connections  makes  efficiency  increase.  We  embed  an  ad  hoc  scale-free 
network  with  power  index  v  >  1  into  a  Euclidean  two-dimensional  space  and  show  that 
the  network  efficiency  becomes  maximal  as  v  approaches  1.  We  therefore  conclude  that 
V  =  \  emerging  from  the  cooperative  interaction  of  units  may  be  a  consequence  of  the 
principle  of  network  maximal  efficiency. 
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1.  Introduction 

The  work  of  Albert  and  Barabasi  (AB)  [1]  prompted  interest  in  scale-free  networks,  namely  networks  with  the  inverse 
power  law  distribution  of  links 

P(k)  a  1  (1) 

with  k  denoting  the  number  of  links  per  node.  The  AB  model  assumed  the  preferential  attachment  of  the  Yule  process  and 
gave  rise  to  the  power-law  index  v  =  3.  The  ubiquity  of  these  networks  has  led  to  the  research  into  the  dynamical  origin 
of  the  scale-free  property.  Fraiman  et  al.  [2]  has  recently  studied  the  two-dimensional  Ising  model  at  criticality  and  using 
a  suitable  threshold  p+  established  a  link  between  the  nodes  with  a  dynamical  correlation  exceeding  that  threshold.  The 
interesting  result  of  that  procedure  was  the  emergence  of  the  scale-free  distribution  density  of  Eq.  (1)  with  a  power  index 
V  that,  with  the  proper  choice  of  average  connectivity  {/<),  gives  values  close  to  v  =  2  [3].  It  is  remarkable  that  the  same 
procedure,  applied  to  the  brain,  yields  results  very  close  to  those  afforded  by  the  Ising  model  [2]. 

Like  the  work  of  Ref.  [2],  we  begin  with  a  regular  two-dimensional  network,  but  then  take  an  entirely  different  approach. 
Rather  than  using  the  Ising  model,  we  adopt  the  Decision  Making  (DM)  model  of  Refs.  [4,5  ]  for  the  creation  of  the  dynamically 
induced  network.  The  key  result  is  that  while  we  also  find  the  emergence  of  a  scale-free  network  at  criticality,  our  network 
has  an  exponent  v  close  to  1. 
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This  finding  suggests  that  the  cooperative  interaction  between  the  units  in  the  network  generated  from  the  DM  model 
should  make  the  network  very  efficient.  However,  we  find  herein  that  the  conventional  measures  of  network  efficiency  [6] 
are  relatively  insensitive  to  v  <  2.  The  second  result  of  this  paper  is  a  new  measure  of  network  efficiency,  the  perception 
length,  that  overcomes  this  limitation,  and  shows  that  network  efficiency  increases  as  v  approaches  1. 

The  outline  of  this  paper  is  as  follows.  We  devote  Section  2  to  proving  the  dynamical  emergence  of  the  scale-free  network 
with  V  =  \.  Section  3  shows  that  v  approaching  1  maximizes  the  perception  length  while  realizing  a  statistical  condition 
equivalent  to  the  dynamically  generated  network.  We  devote  Section  4  to  concluding  remarks. 

2.  Dynamically  induced  scale-free  network  with  v  =  1 

The  question  of  whether  the  network  emerging  from  dynamical  correlations  is  scale-free  is  not  trivial.  For  this  paper,  we 
adopt  the  Decision  Making  (DM)  model  of  Refs.  [4,5].  Although  the  DM  model  yields  a  phase  transition  very  similar  to  that 
of  the  conventional  Ising  model  [7],  the  DM  model  does  not  have  a  Hamiltonian  foundation.  Furthermore,  while  the  Ising 
model  rests  on  the  action  of  a  thermal  bath  at  a  finite  temperature,  and  the  single  spins  in  isolation  and  with  no  thermal 
driving  would  be  dynamically  frozen,  the  units  of  the  DM  model,  in  isolation,  are  driven  by  Poisson  dynamics.  This  difference 
may  lead  to  a  dynamically  created  scale-free  network  different  from  that  of  Ref.  [2]. 

The  DM  model  considers  N  discrete  variables  located  at  the  nodes  of  a  two-dimensional  square  lattice.  Each  unit  S/  is  a 
stochastic  oscillator  and  can  be  found  in  either  of  two  states,  "y^s"  (+1)  or  "no"  (  —  1).  The  cooperation  among  the  units  is 
realized  by  setting  the  transition  rates  between  two  states  to  the  time-dependent  form: 

gu{t)=ge''^   (2) 

and 

g2^{t)=ge-''^   (3) 

where  M  denotes  the  total  number  of  nearest  neighbors,  which  in  the  case  of  a  two-dimensional  lattice  results  in  M  =  4. 
Mi(t)  and  Mjit)  are  the  nearest  neighbors  who  are  making  the  decision  "yes"  and  "no",  respectively.  The  single  unit  in 
isolation,  K  =  0,  fluctuates  between  "yes"  and  "no",  with  the  rate  g.  When  coupling  constant  K  >  0,a  unit  in  the  state  "yes" 
("no")  makes  a  transition  to  the  state  "no"  ("yes")  faster  or  slower  according  to  whether  M2  >  Mi  (Mi  >  M2)  or  M2  <  Mi 
(Ml  <  M2),  respectively.  The  quantity  Kc  is  the  critical  value  of  the  control  parameter  K,  at  which  point  a  phase-transition 
to  a  global  majority  state  occurs.  It  can  be  shown  that  for  a  lattice  of  infinite  size  Kc  =  2  ln(l  +  ^2)  ^  1.7627. 

All  simulations  are  implemented  on  a  lattice  of  N  =  100  x  100  nodes  with  periodic  boundary  conditions.  In  a  single  time 
step,  a  run  over  the  entire  lattice  is  performed  and  for  every  node  an  appropriate  transition  rate  (gi2  or  g2i)  is  calculated 
according  to  which  a  node  is  given  possibility  to  change  its  state.  Under  those  conditions,  setting  parameter  g  =  0.01,  the 
phase  transition  to  the  global  majority  case  occurs  at  Kc  ~  1.70.  To  study  the  dynamically  induced  network  topology,  we 
considered  the  DM  model  with  this  critical  value  of  coupling  constant.  After  an  initial  10^  time  steps,  we  record  2000  lattice 
configurations,  obtaining  the  dynamics  of  each  node  {Si(t)}.  In  the  next  step,  we  evaluate  the  linear  correlation  coefficient 
between  the  z-th  and  the  j-th  node: 

{Si(t)Sj(t))-{Si(t)){Sj(t)) 

r(hj)  =    ,  (4) 


y{s2(t))  -  (Si(t)y^(sj(t))  -  (Sj(t)y 


where  {•  •  •)  stands  for  the  time  average.  Nodes  i  and  j  of  the  dynamically  induced  network  are  connected  by  a  link  when  the 
correlation  between  nodes  i  and  j  of  the  two-dimensional  lattice  is  greater  than  a  given  positive  threshold  p+. 

We  investigated  a  wide  range  of  positive  thresholds  p+  =  [0.10  :  0.90].  As  expected,  correlation  networks  constructed 
with  increasing  value  of  the  threshold  p+  are  characterized  by  a  decreasing  number  of  links,  which  results  in  significant 
changes  in  the  corresponding  degree  distribution.  Networks  obtained  with  low  values  of  the  threshold  p+  result  in  narrow, 
peaked  degree  distributions  and  relatively  high  values  of  the  mean  degree  {/<).  Topologies  resulting  from  very  strong 
correlations,  however,  are  characterized  by  a  distribution  of  links  that  has  a  mean  close  to  the  minimum  value  of  {/<)  =  1 
and  drops  off  very  rapidly.  Interestingly,  in  between  those  conditions  there  exists  a  correlation  threshold  that  creates  a 
dynamically  induced  network  with  a  power-law  distribution  of  degrees  that  is  shown  in  Fig.  1.  The  connectivity  of  a  network 
obtained  under  this  particular  condition  is  inspected  and  only  nodes  that  are  part  of  a  giant  component  are  considered  to  be 
a  part  of  new  topology.  The  resulting  network  consists  of  6740  nodes  with  a  mean  degree  of  {/<)  =  38.  For  a  correlation 
threshold  p+  =  0.61,  we  find  that  the  distribution  of  links  follows  an  inverse  power  law  with  the  scaling  parameter 
V  =  1.20  ±0.23  (R^  =  0.9556). 

How  can  the  dynamical  origin  of  this  extremal  condition  be  explained?  From  an  intuitive  point  of  view  there  exists  a 
connection  between  the  scale-free  distribution  of  links  and  Zipfs  law  [8,9].  In  fact,  if  we  rank  the  nodes  moving  from  the 
richest  to  the  poorest,  according  to  Zipf  [10]  we  should  obtain  the  relation  between  the  connectivity  k  and  rank  r: 


k(r)  =  -, 


(5) 
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Fig.  1.  Degree  distribution  p(k)  for  correlation  network  obtained  with  a  positive  threshold  p+  =  0.61.  The  line  follows  estimated  power-law  relation  with 
the  exponent  of  v  =  1.20. 


with  the  power-law  index  a  being  very  close  to  1.  If  we  select  randomly  a  given  rank  r  with  probability  7r(r)dr  of  being  in 
the  interval  [r,  r  +  dr],  this  should  be  equal  to  the  probability  p(/<)d/<.  of  having  k  links  in  the  interval  [/<,  k  +  d/<], 

p(/<)d/<  =  7r(r)dr.  (6) 

Assuming  that  n  (r)  is  constant,  we  obtain  after  some  algebra 

p(/<)oc-i^,  (7) 
and  thus  the  two  power-law  indices  are  related  by 

v  =  l  +  -.  (8) 

This  leads  us  to  conclude  how  an  efficient  complex  network  is  organized  if  Zipf  s  law  applies.  For  example,  this  suggests 
that  the  intelligence  of  human  brain  should  be  characterized  by  v  =  2.  The  recent  research  work  of  Fraiman  et  al.  [2]  seems 
to  confirm  this  prediction.  The  authors  of  Ref.  [2]  studied  the  two-dimensional  Ising  model  at  criticality  and  using  a  suitable 
threshold  p+  established  a  link  between  the  nodes  with  a  dynamical  correlation  exceeding  that  threshold.  The  interesting 
result  of  that  procedure  was  the  emergence  of  the  scale-free  distribution  density  of  Eq.  (1)  with  a  power  index  v  that,  with 
the  proper  choice  of  average  connectivity  {/<).  gives  values  close  to  v  =  2  [3].  It  is  remarkable  that  the  same  procedure, 
applied  to  the  brain,  yields  results  very  close  to  those  afforded  by  the  Ising  model  [2]. 

It  is  also  interesting  to  notice  that  Boettcher  and  Percus  [11]  applied  the  method  of  extremal  optimization  to  a  Ising- 
like  model  to  evaluate  the  rank  distribution  of  Eq.  (5)  and  found  that  when  the  number  of  units  tend  to  infinity  the  rank 
distribution  tends  to  a  =  1.  This  result  implies  on  the  basis  of  the  earlier  remarks  that  the  cooperation  of  units  in  an  Ising- 
like  model  is  expected  to  confirm  Zipf  s  prediction  and  consequently  v  =  2.  For  results  confirming  this  prediction,  see  also 
Refs.  [12.13]. 

We  have  found,  therefore,  that  the  DM  model  generates  a  scale-free  network  at  criticality.  but  with  v  approaching  1 
which  violates  Zipf  s  condition  of  a  =  1.  The  emergence  of  a  1  corresponds  to  the  number  of  links  k  dropping  very 
quickly  when  we  move  from  r  =  1  to  subsequent  ranks. 


3.  Embedding  scale-free  networks  in  a  two-dimensional  regular  lattice 


The  efficiency  of  a  network  has  been  mainly  studied  on  the  basis  of  topological  length  L  [6].  The  topological  distance 
between  two  nodes  is  the  minimum  number  of  steps  needed  to  move  from  one  node  to  another.  The  quantity  L  is  the 
corresponding  mean  value  of  the  minimum  topological  distance  between  nodes.  As  a  consequence,  the  emergence  of  scale- 
free  networks  can  be  explained  by  making  the  conjecture  that  real  networks  evolve  in  time  so  as  to  realize  maximal 
efficiency.  In  the  case  of  the  scale-free  network  this  efficiency  is  measured  by  the  minimal  value  of  L. 

In  2003.  Cohen  and  Havlin  [14]  determined  that  scale-free  networks  are  very  efficient.  They  calculated  that  networks 
with  N  nodes  have  a  topological  length  L  given  by 

L-ln(lnN),  (9) 

for  a  power-law  index  v  <  3.  which  is  smaller  than  in  the  AB  model. 

As  a  consequence  of  Eq.  (9).  the  topological  length  L  remains  small  even  if  N  is  very  large.  This  relation  between  L  and  N 
suggests  that  the  complex  networks  with  v  <  3  are  very  efficient. 

The  region  v  <  2  has  not  been  studied  by  many  authors.  As  an  interesting  example  of  earlier  work  in  this  region  we 
refer  to  the  work  of  Refs.  [15.16].  In  both  cases  the  authors  adopt  a  kind  of  generalization  of  the  perspective  of  Albert  and 
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Fig.  2.  Perception  length  P  as  a  function  of  v  for  ad  hoc  scale-free  networks.  The  circles  represent  networks  with  constant  (/<)  =  4.95 ;  the  squares  represent 
networks  with  constant  N  =  1313. 

Barabasi  [1],  namely,  they  grow  a  network  with  special  prescriptions  allowing  them  to  overcome  the  limiting  condition 
V  >  3  and  yielding  v  <  2. 

More  recently  there  has  been  some  interest  in  the  efficiency  of  hierarchical  networks  with  v  <  2  [17]  and  v  ~  1  [18,19]. 
These  networks  are  characterized  by  very  small  topological  lengths.  In  the  case  of  Ref.  [19],  for  N  ^  oo  the  authors  predict 
the  value  L  =  2.  Our  numerical  calculations  show  that  L  is  insensitive  to  v  in  the  range  1  <  v  <  2.  Thus,  there  are  good 
reasons  to  believe  that  the  prediction  of  Eq.  (9)  applies  with  no  significant  dependence  on  v. 

There  are  also  good  reasons  to  believe  that  the  dynamic  source  of  scale-free  networks  has  a  probabilistic  rather  than 
a  deterministic  origin,  as  we  shall  demonstrate.  The  work  of  Fraiman  et  al.  [2],  although  fitting  Zipfs  condition,  allows  us 
to  define  the  concept  of  perception  length,  which  is  then  used  to  shed  light  onto  the  condition  v  =  1.  The  nodes  of  a 
two-dimensional  regular  lattice,  where  each  unit  interacts  with  only  its  four  nearest  neighbors,  correspond  to  a  network 
whose  topological  efficiency  is  very  low.  However,  if  we  let  the  units  cooperatively  interact,  and  we  set  a  link  between  two 
nodes,  then  when  their  dynamical  correlation  is  large  enough,  the  resulting  dynamical  network  may  have  a  much  lower 
L.  We  record  the  Euclidean  distance  between  each  pair  of  correlated  nodes,  and  we  define  the  network  efficiency  as  the 
corresponding  mean  value.  More  precisely,  we  define  the  network  efficiency  by  means  of  the  quantity 


and  dij  is  the  Euclidean  distance  between  node  i  and  its  nearest  neighbor  j.  We  refer  to  P  as  the  perception  length  of  the 
system,  this  being  the  measure  of  the  network  efficiency  replacing  1  /L.  For  numerical  analysis,  the  perception  length  P  is 
calculated  as  follows. 

1.  Create  a  2-dimensional  square  lattice,  where  a  is  the  length  (in  units  of  the  lattice)  of  the  side  of  the  square,  and  it  is 
necessary  that       <  a. 

2.  Embed  the  network  on  the  lattice  by  randomly  assigning  the  network  nodes  to  lattice  points. 

3.  Using  periodic  boundary  conditions  for  the  lattice,  calculate  the  Euclidean  length  of  each  link. 

4.  These  lengths  are  then  used  to  calculate  P  in  accordance  with  Eq.  (10). 

This  procedure  was  then  applied  to  scale-free  networks  generated  according  to  the  method  of  Catanzaro  et  al.  [20]  with 
{/<)  =  4.95  and  values  of  v  ranging  from  1  to  2.  The  nodes  were  embedded  randomly  in  a  square  lattice  of  size  just  sufficient 
to  contain  the  network  with  the  largest  number  of  nodes.  The  perception  lengths  were  calculated,  and  the  results  show  that 
over  this  range  the  perception  length  increases  as  v  approaches  1  (Fig.  2). 

We  now  compare  the  sensitivity  of  the  conventional  network  efficiency  (defined  as  the  reciprocal  of  the  network  length 
1/L)  and  the  perception  length  P  to  the  variation  in  v.  These  results  are  shown  in  Fig.  3.  Note  that  the  perception  length  P  and 
1  /L  have  been  normalized  to  allow  the  presentation  in  a  convenient  scale.  These  results  show  that  the  perception  length,  as 
a  measure  of  network  efficiency,  is  more  sensitive  to  changes  in  v  than  the  conventional  1  /L. 

Applying  this  approach  to  the  DM  network  discussed  in  Section  2  above  yields  a  mean  perception  length  =  1382.  To 
provide  a  basis  for  comparison,  we  calculated  the  perception  length  for  a  comparably  sized  ad  hoc  scale-free  network,  also 
prepared  in  accordance  with  the  algorithm  in  Ref.  [20].  The  results  are  compared  in  Table  1,  which  shows  a  very  good 
agreement. 
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Fig.  3.  Network  efficiency  as  a  function  of  v.  In  both  figures,  the  circles  represent  the  normalized  perception  length,  and  the  squares  represent  the 
normalized  1  /L  Error  bars  are  not  shown;  standard  deviations  for  the  normalized  perception  lengths  were  very  small  (of  the  order  of  0.005). 


Table  1 

Comparison  of  perception  lengths  for  the  DM  network  and  a  comparable 
ad  hoc  scale-free  network. 

DM  network     Ad  Hoc  network 


Degree  distribution  exponent,  V  1.20  1.19 

Number  of  nodes,  N  6740  6744 

Number  of  links, /<  128,448  128,413 

Perception  length,  P  1382  1373 


4.  Concluding  remarks 

The  main  result  of  this  paper  is  the  proof  that  the  perception  length  P  defined  by  Eq.  ( 10)  is  more  sensitive  to  v  in  the  range 
1  <  V  <  2,  and  may  therefore  be  a  more  useful  measure  of  network  efficiency  than  the  topological  length  L  [6].  This  new 
definition  is  based  on  the  assumption  that  the  scale-free  networks,  even  those  derived  using  deterministic  and  hierarchical 
arguments  [17-19],  are  compatible  with  the  dynamical  derivation.  The  apparent  conflict  between  the  two  kinds  of  network 
is  explained  by  noticing  that  the  correlation  between  different  nodes  evaluated  according  to  the  numerical  prescription 
of  Section  2  rests  on  time  windows  of  finite  size  and  moving  these  windows  to  different  time  regions  generate  different 
networks  with  the  same  hierarchical  topology.  An  alternative  but  probably  more  attractive  way  to  explain  this  is  that  the 
leadership  moves  in  time  from  some  nodes  to  others.  The  nodes  of  the  two-dimensional  regular  lattice  that  we  used  for  the 
dynamical  derivation  of  the  free-scale  distribution  of  Eq.  (1)  are  totally  equivalent,  and  the  rank  differences  between  the 
richest  and  the  poorest  nodes  are  temporary  and  in  the  long  time  run  the  same  high  rank  is  shared  by  all  the  nodes. 

There  are  some  questionable  aspects  of  the  dynamical  derivation  of  v  =  1,  realized  in  Section  2.  First  of  all  this  result 
depends  on  an  arbitrarily  selected  threshold  p+.  In  addition,  the  power  index  v  derived  from  the  data  in  Fig.  1  is  affected 
by  the  large  fluctuations  of  the  distribution  density  p(/<)  for  large  values  of  /<,  thereby  explaining  why  we  derive  v  =  1 
rather  than  v  =  2,  as  in  the  work  of  Ref.  [3].  However,  on  one  hand,  this  observation  led  us  to  the  discovery  of  the  new 
measure  of  network  efficiency  of  Eq.  (10)  and,  on  the  other  hand,  as  already  mentioned  in  Section  1,  the  DM  model  is  not 
quite  equivalent  to  the  Ising  model. 

Furthermore,  the  interesting  fact  that  a  =  \  (Zipf  s  law)  is  related  to  v  =  2  is  not  totally  convincing.  The  authors  of 
Ref.  [9]  established  a  connection  between  topological  and  dynamical  properties  by  running  a  random  walker  on  a  scale-free 
network  with  the  power  index  v.  They  found  that  the  waiting  time  distribution  density  for  returns  of  the  walker  to  a  given 
node  are  described  by  an  inverse  power  law  with  power  index  /x  =  3  —  v.  Using  a  generalized  central  limit  theorem,  they  also 
proved  that  the  same  index  /x  also  describes  the  inverse  power  law  form  of  the  probability  density  of  relative  frequencies 
P(/)  ^  f~^,so  that,  in  analogy  with  Eq.  (8),  a  rank-frequency  law/  ^  yields  /x  =  1  +  1/a,  hence  a  =  1/(2  —  v), 
namely  a  =  1  for  v  =  1.  It  is  interesting  to  note  that  v  =  1  yields  [1  =  2,  and  that  the  value  /x  ~  2  emerges  from  the  recent 
neurophysiological  work  of  Refs.  [21,22]. 

Thus,  in  addition  to  the  new  efficiency  measure  of  Eq.  (10),  which  is  the  unquestionable  result  of  this  paper,  we  are  also 
led  to  make  the  plausible  conjecture  that  the  DM  model  may  be  an  adequate  tool  to  study  the  emergence  of  intelligence 
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in  the  sense  pointed  out  by  Couzin  [23]  and  Cavagna  et  al.  [24].  According  to  these  authors  the  flocks  of  birds,  as  well  as 
the  brain  [25,26],  are  systems  at  criticality,  with  a  correlation  length  as  large  as  the  size  of  these  systems.  We  are  also  led 
to  make  the  conjecture  that  the  dynamical  generation  of  a  scale-free  network  with  v  =  1  may  be  the  manifestation  of  the 
principle  of  network  maximal  efficiency:  a  network  of  cooperatively  interacting  units  evolves  naturally  so  as  to  maximize 
P,  the  new  measure  of  network  efficiency. 
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Consensus  Networks 
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Abstract — Consensus  algorithms  have  generated  a  lot  of  interest 
due  to  their  ability  to  compute  globally  relevant  statistics  by  only 
exploiting  local  communications  among  sensors.  However,  when 
implemented  over  wireless  sensor  networks,  the  inherent  iterative 
nature  of  consensus  algorithms  may  cause  a  large  energy  con- 
sumption. Hence,  to  make  consensus  algorithms  really  appealing 
in  sensor  networks,  it  is  necessary  to  minimize  the  energy  neces- 
sary to  reach  a  consensus,  within  a  given  accuracy.  We  propose  a 
method  to  optimize  the  network  topology  and  the  power  allocation 
over  each  active  link  in  order  to  minimize  the  energy  consumption. 
We  consider  two  network  models:  a  deterministic  model,  where 
the  nodes  are  located  arbitrarily  but  their  positions  are  known, 
and  a  random  model,  where  the  network  topology  is  modeled  as 
a  random  geometric  graph  (RGG).  In  the  first  case,  we  show  how 
to  convert  the  topology  optimization  problem,  which  is  inherently 
combinatorial,  into  a  parametric  convex  problem,  solvable  with 
efficient  algorithms.  In  the  second  case,  we  optimize  the  power 
transmitted  by  each  node,  exploiting  the  asymptotic  distributions 
of  the  eigenvalues  of  the  adjacency  matrix  of  an  RGG.  We  further 
show  that  the  optimal  power  can  be  found  as  the  solution  of  a 
convex  problem.  The  theoretical  findings  are  corroborated  with 
extensive  simulation  results. 

Index  Terms — Consensus  algorithms,  minimum  energy  con- 
sumption, random  graphs,  sensor  networks,  topology  control. 


I.  Introduction 

AVERAGE  consensus  algorithms  have  received  consider- 
able attention  in  recent  years  because  of  their  ability  to 
enable  globally  optimal  decisions  using  only  local  exchange  of 
information  among  nearby  nodes  [l]-[3].  The  price  paid  for 
this  simplicity  and  the  underlying  decentralized  philosophy  is 
that  consensus  algorithms  are  inherently  iterative.  As  a  con- 
sequence, the  implementation  of  consensus  algorithms  over  a 
wireless  sensor  network  (WSN)  requires  an  iterated  exchange 
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of  data  among  the  nodes,  which  might  cause  an  excessive  en- 
ergy consumption.  This  must  be  contrasted  with  a  centralized 
strategy  where  there  is  a  sink  node  that,  after  collecting  all  the 
observations  from  the  sensors  (perhaps  over  multiple  hops),  is 
virtually  able  to  compute  the  desired  statistic  in  a  single  shot. 
Hence,  to  make  consensus  algorithms  practically  appealing  in 
a  sensor  network  context,  it  is  necessary  to  minimize  the  en- 
ergy consumption  necessary  to  reach  consensus.  Clearly,  the 
network  topology  plays  a  fundamental  role  in  determining  the 
convergence  rate  [4] .  It  is  well  known  that,  as  the  network  con- 
nectivity increases,  so  does  the  rate  of  convergence.  However, 
having  a  densely  connected  network  requires  a  high  power  con- 
sumption to  guarantee  reliable  direct  links  between  many  nodes. 
In  principle,  having  a  fully  connected  network  is  equivalent  to 
having  as  many  sink  nodes  as  sensors,  so  that  the  convergence 
time  of  fully  connected  networks  is  minimum.  However,  the 
power  consumption  necessary  to  maintain  a  fully  connected  net- 
work is  also  maximum.  On  the  other  hand,  a  minimally  con- 
nected network  entails  low  power  consumption  to  maintain  a 
few  links,  but,  at  the  same  time,  it  requires  a  large  convergence 
time.  Since  what  really  matters  in  a  WSN  is  the  overall  power 
spent  to  achieve  consensus,  this  paper  addresses  the  problem 
of  finding  the  optimal  network  topology  that  minimizes  this 
overall  power  consumption,  taking  into  account  convergence 
time  and  transmit  power  necessary  to  establish  reliable  links 
jointly.  The  search  for  the  optimal  topology  is,  per  se,  a  combi- 
natorial problem  whose  solution  becomes  prohibitive  even  for 
small  scale  networks.  Nevertheless,  we  will  present  a  relaxation 
technique  that  converts  the  problem  into  a  convex  problem,  with 
minimum  performance  loss. 

If  the  links  among  the  nodes  are  symmetric,  or,  equivalently, 
if  the  graph  describing  the  network  topology  is  undirected,  the 
convergence  rate  can  be  lower  bounded  through  the  so  called 
algebraic  connectivity,  defined  as  the  second  smallest  eigen- 
value of  the  graph  Laplacian  [5].  For  this  reason,  there  have 
been  works  aimed  at  maximizing  the  algebraic  connectivity  of 
a  given  undirected  graph  by  a  suitable  choice  of  the  weights  as- 
sociated to  each  edge  [6],  [7].  Alternatively,  in  [8]  it  was  shown 
how  some  network  topologies,  such  as  small  world  graphs,  can 
greatly  increase  the  convergence  rate.  In  [9],  the  authors  show 
that  nonbipartite  Ramanujan  graphs  constitute  a  class  of  topolo- 
gies maximizing  the  convergence  rate.  In  [10]  it  was  shown  how 
to  add  edges  from  a  given  set  to  a  graph  in  order  to  maximize  its 
algebraic  connectivity.  Other  works,  for  example  [11],  proposed 
strategies  to  improve  the  convergence  rate  of  gossip  algorithms 
through  geographic  routing. 
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However,  in  all  these  works,  the  focus  is  always  on  conver- 
gence time  and  there  is  no  cost  associated  with  the  establish- 
ment of  the  graph  topology.  Conversely,  in  our  work,  since  the 
graph  represents  a  real  network,  we  consider  as  the  optimiza- 
tion metric  the  power  consumption  necessary  to  maintain  re- 
liable communication  links  among  the  nodes,  taking  into  ac- 
count the  radio  propagation  model,  multiplied  by  the  number  of 
iterations  necessary  to  achieve  consensus.  From  this  perspec- 
tive, enforcing  a  small  world,  scale-free,  or  Ramanujan  graph 
topology  may  not  be  the  best  choice  for  a  wireless  network, 
whose  topology  should  depend  on  the  application  at  hand  as 
well  as  on  the  propagation  model. 

In  the  wireless  communication  network  context,  there  have 
been  works  on  optimizing  the  network  topology  in  order  to  min- 
imize the  power  consumption  necessary  to  guarantee  connec- 
tivity, e.g.,  [12],  [13].  These  works  concentrate  on  the  communi- 
cation task  and  do  not  make  any  specific  reference  to  the  running 
application.  However,  it  is  now  well  established,  in  the  sensor 
network  context,  that,  whenever  possible,  an  efficient  wireless 
sensor  network  design  should  take  into  account  the  specific  goal 
of  the  sensor  network  [14].  For  this  reason,  we  focus  on  the 
achievement  of  consensus  in  a  WSN.  It  is  well  known  that  one 
of  the  most  crucial  parameters  in  WSNs  is  energy  consumption, 
because  in  many  contexts  it  is  hard  to  recharge  the  batteries  or 
scavenge  energy  from  the  environment.  For  this  reason,  in  most 
applications,  minimizing  energy  consumption  is  more  appro- 
priate than  minimizing  convergence  time  (although,  there  are 
important  applications  where  the  latter  could  be  more  impor- 
tant). In  [15],  the  minimum  energy  consumption  problem  was 
studied,  assuming  a  common  transmit  power.  As  shown  in  [15], 
there  typically  is  an  optimum  power  that  minimizes  the  energy 
necessary  to  achieve  consensus  within  a  prescribed  accuracy. 

In  this  work,  we  generalize  the  initial  idea  suggested  in 
[15]  and  we  propose  a  method  for  optimizing  the  network 
topology  and  the  power  allocation  across  every  link  in  order 
to  minimize  the  energy  necessary  to  achieve  consensus.  We 
consider  two  classes  of  networks:  a)  deterministic  topologies, 
with  arbitrary,  but  known,  node  locations,  and  b)  random 
geometries,  with  unknown  node  locations,  modeled  as  random 
variables.  In  the  deterministic  case,  we  optimize  both  topology 
and  power  allocation.  Differently  from  [6],  we  do  not  assume 
any  prior  topology,  as  the  topology  comes  out  as  a  result  of  the 
optimization.  Topology  optimization  is,  in  general,  a  combina- 
torial problem  and  hence  an  NP-hard  problem  (recall  that  an 
undirected  graph  composed  of  n  nodes  may  assume  2^*^^"^)/^ 
topologies).  To  tackle  this  issue,  we  propose  a  relaxation  tech- 
nique that  allows  us  to  formulate  topology  optimization  as  a 
convex  parametric  problem.  Then,  we  show  that  the  effect  of 
this  relaxation  on  the  performance  is  negligible. 

In  the  random  topology  case,  where  the  intemode  distances 
are  unknown,  we  show  how  to  optimize  the  single  (common) 
transmit  power,  modeling  the  network  topology  as  a  random 
geometric  graph,  a  model  suitable  for  wireless  networks.  We 
provide  both  theoretical  and  simulation  results,  exploiting  the 
theory  of  the  eigenvalues  of  random  geometric  graphs. 

The  paper  is  organized  as  follows.  In  Section  II,  we  briefly  re- 
view the  consensus  algorithm.  In  Section  III,  we  introduce  our 
communication  model  and  formulate  the  optimization  problem. 


Section  IV  is  devoted  to  topology  optimization  for  arbitrary 
networks.  In  Section  V,  we  start  by  providing  a  closed  form 
expression,  albeit  approximate,  for  the  algebraic  connectivity 
of  a  random  geometric  graph.  Then  we  use  this  expression  to 
proceed  with  the  topology  optimization  for  random  geometric 
graphs.  The  analytical  findings  are  corroborated  with  extensive 
simulation  results. 

II.  Brief  Review  of  Consensus  Algorithms 

Let  us  consider  a  wireless  network  composed  of  n  sensors. 
The  network  topology  can  be  represented  as  an  undirected  graph 
G  =  {V^E}  where  V  denotes  the  set  of  n  vertices  (nodes) 
Vi  and  E  C  F  x  y  is  the  set  of  bidirectional  edges  (links) 
dj  =  Cji  connecting  Vi  and  Vj.  Furthermore,  let  A  be  the 
n  X  n-dimensional  symmetric  adjacency  matrix  of  the  graph  G, 
with  elements  aij  =  1  if  Cij  G  E  and  aij  =  0  otherwise.  Ac- 
cording to  this  notation  and  assuming  no  self-loops,  i.e.,  an  =  0^ 
Vz  =  1, . . . ,  n,  the  degree  of  node  Vi  is  defined  as  deg{vi)  = 

n  n 

S  ^ij  —  S  ^ji-  "^he  degree  matrix  D  is  defined  as  the  di- 

agonal  matrix  whose  diagonal  entry  is  da  =  deg{vi).  Let  Mi 
denote  the  set  of  neighbors  of  node  z,  so  that  \Afi\  =  degiviY. 
The  Laplacian  matrix  of  the  graph  C  is  the  n  x  n  symmetric 
matrix  L  :=  D  —  A,  whose  entries  are 

f    -f    _jdeg{vi)    iij  =  i 

Given  a  set  of  measurements  Xi{0),  collected  by  node  i,  for 
i  =  1, . . . ,  n,  the  goal  of  the  consensus  algorithm  is  to  allow 
every  node  to  compute  a  globally  optimal  function  of  the  mea- 
surements, say  :r* ,  through  a  decentralized  mechanism  that  does 
not  require  the  presence  of  a  sink  node,  but  builds  on  the  interac- 
tion among  nearby  nodes  only.  Let  us  consider,  for  simplicity, 
the  case  where  x'^  is  the  average  of  the  measurements.  In  this 
case,  reaching  an  average  consensus  can  be  seen  as  the  mini- 
mization of  the  disagreement  between  the  states  Xi  of  the  inter- 
acting nodes.  One  of  the  nice  properties  of  the  Laplacian  is  that 
the  disagreement  can  be  expressed  as  a  quadratic  form  built  on 
the  Laplacian  [16] 

1 

i=i  jeAfi 

^    n      n  ^ 
=  A^Yl  ^^^^^^  ~  ^^^^  =  9 

An  important  property  of  the  Laplacian  is  that  it  has,  by  con- 
struction, a  zero  eigenvalue,  whose  multiplicity  is  equal  to  the 
number  of  connected  components  of  a  graph.  Hence,  a  graph  is 
connected  if  its  zero  eigenvalue  has  multiplicity  one  [5].  Fur- 
thermore, if  the  graph  is  connected,  the  eigenvector  associated 
with  the  null  eigenvalue  is  the  vector  1,  composed  of  all  ones. 

The  minimization  of  the  quadratic  form  in  (2)  can  be  achieved 
using  a  simple  steepest  descent  technique.  In  continuous  time, 
the  minimum  of  (2)  can  be  reached  through  the  following  up- 
dating rule  [2] : 

x{t)  =  -Lx{t)  (3) 

^By  I  •  I  we  denote  the  cardinality  of  the  set. 
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initialized  with  x{0)  :=  xq^.  By  construction,  the  eigenvalues 
of  L  are  all  nonnegative.  Hence,  the  convergence  of  (3)  is  guar- 
anteed. In  particular,  if  the  graph  is  connected,  the  state  vector 
x{t)  in  (3)  converges  to  the  projection  of  the  initial  state  xq  onto 
the  nullspace  of  L,  spanned  by  the  vector  1,  i.e.,  [2] 


I 

X(t)   >  -1  l^XQ. 

t^oo  n 


(4) 


This  corresponds  to  having  all  the  nodes  converging 
to  a  consensus  on  the  average  of  the  initial  observa- 
tions X*  =  Yl^=i^i{^)l'^-  Moreover,  the  convergence 
rate  is  lower  bounded  by  the  slowest  decaying  mode, 
i.e.,  by  the  second  smallest  eigenvalue  of  L,  \2{L), 
also  known  as  the  algebraic  connectivity  of  the  graph. 
More  specifically,  if  the  graph  is  connected,  the  dynamic 
system  (3)  converges  to  consensus  exponentially  [2],  i.e., 
\\x{t)  -  <  ||a;(0)  -  :z;*l||0(e-^*),  with  r  =  \2{L). 

Defining  the  disagreement  vector  as  Ax(t)  =  x{t)  —  we 
can  write  ||Aj;(t)||  <  || Aa;(0)||(9(e~^*).  As  a  consequence, 
the  convergence  time  can  be  defined  (see  also  [2])  as  the 
time  necessary  for  the  slowest  mode  of  the  dynamical  system 
(3)  to  be  reduced  by  a  factor  7  <C  1,  i.e.,  the  time  for  which 
||Aj;(Tc)||  <  7||Aj;(0)||.  Hence,  we  can  set 


lQg(7) 
'  \2{L)' 


(5) 


III.  Optimization  Criterion 

It  is  well  known  that  one  of  the  most  critical  issues  in  WSN  is 
energy  consumption.  Neglecting  for  simplicity  the  power  spent 
for  processing  operations  with  respect  to  the  power  to  be  used 
to  enable  wireless  communications,  the  overall  power  spent  to 
reach  consensus  is  the  product  between  the  sum  of  the  power 
Ptot  necessary  to  establish  the  communication  links  among  the 
nodes  and  the  number  of  iterations  A^it  necessary  to  achieve 
consensus.  The  exchange  of  information  among  the  nodes  is 
supposed  to  take  place  in  the  presence  of  a  slotted  system,  with  a 
medium  access  control  (MAC)  mechanism  that  prevents  packet 
collisions.  Denoting  by  Tg  the  duration  of  a  time  slot  unit,  the 
number  of  iterations  is  then  approximately ^  A^it  =  Tc/Tg. 

Introducing  the  power  coefficients  pij^  ^  7^  J,  denoting  the 
power  used  by  node  i  to  transmit  to  node  j  with  pij  =  pji,  and 
the  binary  coefficients  aij  assessing  the  presence  {aij  =  1)  of  a 
link  between  nodes  i  and  j  or  not  (a^j  =  0),  the  power  spent  by 
the  whole  network  in  each  iteration  is  then  Ptot  =  j  ^ijPij- 
Using  (5),  our  goal  is  to  minimize  the  following  performance 
metric: 


(6) 


^The  discrete-time  counterpart  of  (3)  is  [2] 

x[k  +  1]  =  x[k]  -  eLx[k]  :=  Wx[k] 

where  e  is  a  parameter  chosen  so  as  to  ensure  that  no  eigenvalue  of  W  is  greater 
than  one  in  modulus. 

^We  neglect  rounding  errors,  which  tend  to  vanish  if  the  duration  of  the  up- 
dating time  slot  is  small  with  respect  to  the  convergence  time. 


where  K  incorporates  all  irrelevant  constants.  This  metric  is 
proportional  to  the  integral  of  power  consumption  with  respect 
to  time  and  then  it  represents  an  energy  consumption.  In  (6), 
we  have  made  explicit  the  dependence  of  the  Laplacian  L  on 
the  vector  a  =  A{:)  containing  all  binary  coefficients  aij,  since 
finding  these  coefficients  is  the  goal  of  our  optimization.  More 
specifically,  our  goal  is  to  find  the  set  of  active  links,  i.e.,  the 
nonzero  coefficients  aij,  and  the  powers  pij  that  minimize  (6), 
under  the  constraint  of  maintaining  network  connectivity,  which 
entails  X2{L(a))  >  0.  The  problem  can  then  be  formulated  as 
follows: 


mm 

S.t. 


En  ^—^n 
i=l  Z^j=l  ^ijPij 

e  <  A2(L(«)) 


[P.O] 


G{0,1} 


Pij  >  0    Vi,j  =  l,...,n 


(7) 


where  e  is  an  arbitrarily  small  positive  constant  used  to  prevent 
the  algebraic  connectivity  from  going  to  zero,  which  would  cor- 
respond to  a  disconnected  network  and  pis  the  vector  with  en- 
tries Pij . 

Since  the  topology  coefficients  are  binary  variables,  [P.O]  is 
a  combinatorial  problem,  with  complexity  increasing  with  the 
size  n  of  the  network  as  2^^^"^)/^.  Hence,  its  solution,  for 
medium/large  scale  networks  is  prohibitive.  Our  objective  is  to 
modify  [P.O]  in  order  to  turn  it  into  a  convex  problem,  with  neg- 
ligible performance  losses. 

A  first  simplification  comes  from  observing  that  the  coef- 
ficients aij  and  pij  are  not  independent  of  each  other.  Their 
dependence  is  indeed  a  consequence  of  the  radio  propagation 
model.  In  this  work,  given  the  complexity  of  the  topology  op- 
timization problem,  we  assume  a  fairly  simple  communication 
model.  We  state  that  there  is  a  link  between  nodes  i  and  j,  and 
then  aij  =  1,  if  the  signal-to-noise  ratio  SNRj  at  the  receiver 
node  j,  when  node  i  transmits,  exceeds  a  minimum  value  5min. 
i.e.,  SNRj  >  Smin-  If  we  denote  by  pRj  the  power  received  by 
node  j  when  node  i  transmits,  and  by  cr^  the  noise  power,  as- 
sumed for  simplicity  to  be  the  same  at  each  receiving  node,  we 
have  aij  =  1,  if  pRj  >  SminO'n  '—  Pmin,  or  otherwise  aij  =  0. 
Assuming  flat  fading  channel  modeling,  we  use  the  following 
propagation  model: 

Pij 


PRj 


1  +  (r^,/ro)^ 


(8) 


where  rij  is  the  distance  between  nodes  i  and  j,  and  r]  is  the 
path  loss  exponent.  The  parameter  vq  plays  the  role  of  a  scaling 
factor  or  reference  distance,  and  typically  corresponds  to  the  so 
called  Fraunhofer  distance,  such  that,  if  rij  ro,  the  receiver 
is  in  the  transmit  antenna  far-field,  where  the  received  power 
is  inversely  proportional  to  r^^  ;  conversely,  if  Vij  <C  ro,  the 
receiver  is  in  the  transmit  antenna  near-field,  where  the  received 
power  is  approximately  equal  to  the  transmitted  one.  The  unity 
term  in  the  denominator  of  (8)  is  used  to  avoid  the  unrealistic 
situation  in  which  the  received  power  could  be  greater  than  the 
transmitted  one.  Given  the  propagation  model  (8),  the  relation 
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between  the  power  coefficients  pij  and  the  topology  coefficients 


a,-i  is  then 


1    if  Pij  > 
0  otherwise 


(9) 


In  the  following  sections  we  will  show  how  to  relax  this  rela- 
tion in  order  to  simplify  the  solution  of  the  optimal  topology 
control  problem.  We  will  consider  two  scenarios:  a)  a  deter- 
ministic topology,  with  arbitrary  node  geometry,  where  the  dis- 
tances between  the  nodes  are  known,  and  b)  a  random  topology, 
where  the  positions  of  the  nodes  are  unknown  and  modeled 
as  random  variables.  In  case  a),  since  the  distances  among  the 
nodes  are  known,  we  optimize  the  power  allocated  to  each  link. 
This  strategy  is  also  instrumental  in  determining  the  topology, 
which  is  equivalent  to  finding  the  n(n  —  l)/2  entries  aij  of 
the  adjacency  matrix.  Conversely,  in  the  random  topology  case, 
since  the  distances  are  not  known,  we  assume  broadcast  com- 
munications, and  look  for  the  optimization  of  the  coverage  ra- 
dius of  each  node.  The  deterministic  topology  case  will  be  the 
subject  of  Section  IV,  whereas  the  random  case  will  be  studied 
in  Section  V. 

IV.  Optimal  Topology  and  Power  Allocation  for 
Arbitrary  Networks 

In  the  case  where  the  distances  between  the  nodes  are  known, 
the  optimization  criterion  amounts  to  solving  problem  [P.O], 
which  involves  a  combinatorial  strategy  that  makes  the  problem 
numerically  very  hard  to  solve,  especially  for  medium/large 
scale  networks.  The  relation  (9)  reduces  the  set  of  unknowns 
to  the  set  pij ,  but  the  problem  still  retains  most  of  its  difficul- 
ties. To  make  problem  [P.O]  to  be  numerically  tractable,  we 
introduce  a  first  relaxation  so  that,  instead  of  requiring  aij  to 
be  binary,  we  assume  aij  to  be  a  real  variable  belonging  to 
the  interval  [0,1].  Under  this  assumption,  problem  [P.O]  is 
not  combinatorial  anymore,  but  it  is  still  a  nontrivial  nonlinear 
constrained  problem.  The  first  important  contribution  of  this 
paper  is  to  propose  a  relaxation  technique  that  transforms  the 
previous  problem  into  a  convex  problem  that  can  be  solved  with 
well  established  and  efficient  numerical  tools.  We  achieve  this 
goal  by  first  introducing  the  following  relationship  between  the 
coefficients  aij  and  the  distances  Vij : 


(10) 


where  is  a  positive  coefficient  and  Tc is  the  coverage  radius, 
which  depends  on  the  transmit  power.  According  to  (10),  aij 
is  close  to  one  when  node  j  is  within  the  coverage  radius  of 
node  i,  i.e.,  Tij  <C  r^^,  whereas  aij  is  close  to  zero,  when 
^ij  Vcij .  The  switching  from  zero  to  one  can  be  made  steeper 
by  increasing  the  value  of  a. 

Given  the  propagation  model  (8),  the  coverage  radius  rc-^  is 
related  to  the  power  pij  and  the  minimum  power  required  for 
reliable  communication  Pmin^  as  follows: 


1/r? 


(11) 


Plugging  (1 1)  in  (10),  the  coefficients  aij  can  be  written  explic- 
itly in  terms  of  the  power  coefficients  as  follows: 


aij  —  ^iji.Pij^ 


r?(Ki-Pmi„)"/''  +  rf,Ki^ 


(12) 


This  relation  can  be  also  inverted  to  find  the  coefficients  pij  as 
a  function  of  ai j ,  as  follows: 


Pij  =  q{aij)  =  Pmin  +  h  I  _ 

\  I-  CLii 


7]/a 


(13) 


with  ki  =  Pmin^-  Expression  (12)  becomes  our  relaxed  ver- 
sion  of  (9)  and  it  allows  us  to  reduce  the  set  of  variables  to  the 
only  power  vector  p.  Consequently,  problem  [P.O]  can  be  re- 
laxed into  the  following  problem: 


mm 


S.t.      €<X2{L(p))  [P.l] 
Pminl  <  P 


(14) 


where,  thanks  to  (12),  the  Laplacian  is  now  written  explicitly 
in  terms  of  the  power  coefficients  pij .  In  principle,  the  last  in- 
equality in  (14)  makes  any  link  feasible.  But  this  does  not  imply 
that  the  final  network  will  be  fully  connected,  because  some  co- 
efficients aij  might  turn  out  to  be  equal  zero,  implying  that  the 
link  between  node  i  and  j  is  not  active.  The  first  important  re- 
sult, related  to  the  solution  of  (14),  is  the  following. 

Theorem  1:  Given  the  propagation  model  in  (8),  using  the 
relations  (12)  between  the  topology  coefficients  aij  and  the 
power  terms  pij,  problem  [P.l]  is  a  convex-concave  fractional 
problem  if    >  a. 

Proof:  Let  us  consider  the  objective  function  in  [P.l]. 
The  numerator  of  [P.l]  is  clearly  a  convex  function  of  p.  We 
only  have  to  prove  the  concavity  of  the  algebraic  connectivity 
\2{L{p))  with  respect  to  the  transmit  powers.  As  a  first  step, 
we  prove  that  aij{pij)  is  a  concave  function  of  pij.  Then,  we 
use  this  to  show  that  A2  ( • )  is  a  concave  function  of  p. 

(0  ciij(pij)  is  a  concave  function  of  pij\  We  compute  the 
second-order  derivative  of  the  function  in  (12)  with  respect  to 

Pij 

^  ^'^^^'^^  =  ksiia  -  r])k4  -  k2(a  +  rj)(pij 


(Ppi, 


■Pn 


(15) 


where  the  constants  k2 ,  ks  and  k^  are  given  by 

ak4k2{pij  -PminT^"^'^ 


h  =  ^0,  ^3  = 
—  ^ijPmin' 


[h  +  k2{p^J-Pmin)''^'^]^V^' 


Note  that  k2^k4  >  0;  since  pij  >  pmin  Vi,j,  we  also  have 
ks  >  0.  From  (15),  we  see  that,  if  <  r^,  the  second-order 
derivative  is  always  nonpositive  and  then  aij{pij)  is  a  concave 
function  of  pij . 

(//)  A2(  • )  is  concave  in  p:  Exploiting  the  properties  of  the 
Laplacian  [16],  we  can  write  the  quadratic  form  associated  to 
L  as  in  (2).  From  (12),  we  also  note  that  each  coefficient  aij 
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depends  only  on  the  corresponding  link  power  pij ,  and  not  on 
the  other  link  powers.  Given  any  pair  of  power  vectors  p^^^  and 
let  p  =         +  (1  -  P)P^'^\  with  0  <  /?  <  1.  Hence,  we 

have 

i=i  j=i 

^      n  n 

= ^  E  E  K'^  +  (1  -  (i)p^)  (-i  -  -if 


i=i  j=i 

n  n 


i=ij=i 


=  px^L  (a  (p^i)))  X  +  {l-p)x^L  (a  (p^^)^)  x 

(16) 

where  the  inequality  follows  from  the  concavity  of  aij  {pij ) . 

We  further  recall  that  the  algebraic  connectivity  A2  (L)  is  the 
solution  of  the  following  positive  semidefinite  program  (SDP) 
[5]: 

X^(L)  =     min     x^Lx.  (17) 

x±l,\\x\\=l 

Hence,  using  (16),  we  obtain 

A2  (L(/3pW  +  (l-/?y2))) 

>/3A2(i(pW))  +  (l-/3)A2  {l(p(^^)) 

where  the  last  equality  follows  from  a  majorization  theorem  for 
eigenvalues^.  Thus,  the  algebraic  connectivity  is  a  concave  func- 
tion of  the  vector  of  transmit  powers.  Hence,  (14)  is  a  convex- 
concave  fractional  problem  since  it  is  the  ratio  of  a  convex  and 
a  concave  function  of  p.  ■ 
Since  (14)  is  a  convex-concave  fractional  problem,  we  can  use 
one  of  the  methods  that  solve  quasi-convex  optimization  prob- 
lems, see,  e.g.,  [18],  [19].  For  example,  we  can  use  the  nonlinear 
parametric  formulation  proposed  in  [19].  To  do  so,  we  introduce 
the  following  function: 

h{^)  =  min{p^l  -  ^X2{L{p))  :peT} 

where  fi  is  a.  real  positive  parameter  and  T  =  {p  :  p  > 
IPmin?  A2(iv(p))  >  e}.  In  order  to  find  the  solution  of  this 
problem  we  could  use  the  following  result,  proved  in  [19]  and 
[20]. 

Theorem  2:  Let  f{x)  and  g{x)  be  continuous  real- valued 
functions  V:r  G  B,  where  6  is  a  nonempty  compact  subset  of 
W  and  g{x)  >  0      G  B.  Then 


^*  =  =  mm 


M 


^Let  A,  B  bQ  Hermitian  matrices,  and  let  C  =  A  +  B.  Let  their  eigen- 
values be  sorted  in  nondecreasing  order.  Then  the  vector  of  eigenvalues  A(C) 
majorizes  the  vector  A(A)  +  X{B).  Since  Ai(L)  =  0  for  the  graph  Laplacian, 
it  follows  that  A2( A -hS)  >  A2(A)  +  X2{B).  See  [17,  Theorem  4.3.27]. 


with  X*  G  B,  if,  and  only  if 

/i*  =  h{iJ.'',x*)  =  min{/(:2;)  -  iJ.*g{x)  :  x  G  G}  =  0 

where  /i(^*,:z;*)  means  that  for  jj,  =  jj,'^  the  minimum  of 
{f{x)  -  lJi'g{x)  \  X  GG}is  taken  ondXx  =  x"". 

Before  applying  Theorem  2,  it  is  useful  to  further  convert  the 
convex-concave  optimization  problem  [P.l]  into  the  following 
parametric  problem: 

min    p^l-  iJi\2{L{p)) 


s.t.    e<\2{L{p))  [P.2] 


(18) 


By  Theorem  1,  X2{L{p))  is  a  concave  function  of  p.  Hence,  the 
objective  function  in  [P.2],  as  sum  of  convex  functions,  is  a 
convex  function.  The  constraint  sets  are  convex.  Then,  problem 
[P.2]  is  a  convex  parametric  problem,  whose  solution  is  a  func- 
tion of  the  parameter  11  that  controls  the  tradeoff  between  the 
global  transmit  power  and  the  convergence  time.  Later  on,  we 
will  show  how  to  find  the  optimal  ji. 

Since  problem  [P.2]  is  convex,  it  can  be  solved  using  nu- 
merically efficient  convex  programming  tools.  However,  before 
applying  any  convex  tool,  it  is  worth  noticing  that  the  feasible 
set  in  [P.2]  is  not  compact.  Hence,  even  if  a  solution  exists,  in 
principle,  it  could  be  unreachable  in  finite  time.  To  overcome 
this  potential  drawback,  we  propose  next  an  alternative  formu- 
lation of  [P.2]:  Instead  of  looking  for  the  set  of  power  coeffi- 
cients Pij,  and  then  for  the  a^j,  using  (13),  we  can  reformulate 
[P.2]  so  as  to  look  directly  for  the  variables  aij.  Then  the  opti- 
mization problem  in  (18)  can  be  rewritten  in  terms  of  vector  a, 
as  follows: 


min    (f){a)  —  /iX2{L{a)) 

a 

S.t.    6  <  X2{L{a))  [P.3] 
0  <  a  <  1 


(19) 


where  (^(a)  =  X^^^^         Qi^^ij)-  We  verify  next  that  this 

problem  is  still  convex.  To  study  the  behavior  of  (p{a),  we  com- 
pute the  second-order  derivative  of  q{aij),  obtaining 


We  infer  that 


a  \1  — 


r]/a-2 


{l-aij}'^  \a  ) 


(Pq{aij) 


>0^rj-a-\-  2aaij  >  0. 


We  note  that,  if  7^  >  o^,  then  ^-^^  >  0  for  0  <  aij  <  1, 
SO  that  (j){a),  as  a  sum  of  convex  functions,  is  convex^.  Finally, 
the  algebraic  connectivity  X2{L{a))  is  a  concave  function  of 
a,  as  can  be  proved  following  the  same  steps  as  in  Theorem 

^Note  that  (f){a)  is  the  sum  of  functions  of  the  single  variables  aij .  Hence  the 
convexity  of  0(a)  can  be  studied  looking  at  the  convexity  of  the  single  functions 

q{aij). 
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1.  Then,  the  optimization  problem  [P. 3]  is  also  a  convex  para- 
metric problem,  perfectly  equivalent  to  the  original  problem  in 
(18),  since  the  change  of  variables  pij  =  q{aij)  in  (13)  ensures 
a  one-to-one  mapping  q  :  R  [R,  for  0  <  aij  <  1,  with 
image  covering  the  problem  domain  in  (18)  (see  [21,  p.  130]). 
Hence,  assuming  0  <  a  <  1  —  e  ,  with  e  an  infinitesimal  posi- 
tive constant  so  that  the  feasible  set  of  (19)  T  =  {a  :  0  <  a  < 
1  — e  ,  A2  (iv(a))  >  e}  is  a  compact  convex  set  in  we 
ensure  that  the  set  of  minima  of  (19)  is  nonempty  and  by  the  con- 
vexity of  the  problem  we  can  deduce  that  all  local  minima  are 
also  global.  Note  that  since  the  optimization  problem  in  (19)  is 
convex  an  optimal  solution  can  be  found  via  efficient  numerical 
tools.  Furthermore,  using  Dinkelbach's  algorithm  [19],  based 
on  Theorem  2,  we  are  also  able  to  find  the  optimal  parameter  /j. 
in  [P. 3].  More  specifically,  the  Dinkelbach's  algorithm,  applied 
to  our  problem,  proceeds  through  the  following  steps: 

1.  Set  i  =  1  and  let  ai  be  a  feasible  point  of  T,  with  /j^i  = 

2.  Set  jj.  =  jj^i  and  find  a^+i  G  T  that  solves  the  minimization 
problem  in  [P. 3]; 

3.  If  I  h{ii,aij^i)  1  =  1  (l){aij^i)- ii\2{L{ai  j^i))  \<  e'',  withe'' 
an  arbitrarily  small  positive  constant,  stop  and  take  a^+i  as 
the  optimal  link  coefficient  vector;  otherwise,  set  z  =  z  + 1, 

Since  the  topology  coefficients  aij  obtained  in  this  way  are 
real  variables  belonging  to  the  interval  [0, 1],  to  obtain  the  net- 
work topology,  it  is  necessary  to  quantize  them  to  convert  them 
into  binary  values,  1  or  0,  indicating  the  presence  or  absence 
of  a  link.  This  quantization  is  achieved  by  comparing  each  aij 
with  a  threshold  ath-  Of  course,  the  final  topology  will  depend 
on  the  threshold  value.  Moreover,  the  thresholding  operation 
will  also  affect  the  final  result  in  terms  of  convergence  time 
and  energy  consumption.  It  is  then  of  interest  to  check  how 
sensitive  the  final  topology,  as  well  as  convergence  time  and 
energy  consumption,  are  to  the  choice  of  ath-  In  the  ensuing 
section,  we  present  some  numerical  results  to  shed  light  on  the 
resulting  topologies  and  their  dependence  on  the  propagation 
model  parameters. 

A.  Numerical  Examples 

Since  our  optimization  procedure  is  based  on  a  relaxation 
technique,  the  first  important  step  is  to  evaluate  the  impact  of 
the  relaxation  on  the  final  topology  and  performance. 

Example  1:  Comparison  Between  Exhaustive  Search  and  Re- 
laxed Technique:  We  compare  now  the  topology  obtained  as  a 
solution  of  the  relaxed  problem  [P. 3]  with  the  optimal  graph 
obtained  by  solving  directly  problem  [P.O]  using  an  exhaustive 
search  over  all  possible  topologies.  For  complexity  reasons,  of 
course  we  can  only  perform  this  comparison  for  small  scale  net- 
works. We  consider  networks  of  n  =  4  and  6  nodes. 

To  provide  results  not  conditioned  to  a  specific  geographic 
node  deployment,  we  averaged  the  results  over  100  statistically 
independent  realizations  of  the  nodes  locations.  In  each  iteration 
we  compute  the  minimum  energy  <f  op  reached  using  the  optimal 
exhaustive  search  over  all  possible  topologies,  and  the  energy 
Er  corresponding  to  the  network  topology  whose  coefficients 
aij  are  obtained  by  solving  problem  [P. 3]  and  thresholding  the 
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Fig.  1.  Optimal  average  energy  obtained  with  exhaustive  search  and  average 
energy  obtained  by  solving  the  relaxed  problem  [P. 3]  versus  77,  for  different  n 
values. 
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Fig.  2.  Ratio  between  the  optimal  energy  obtained  with  exhaustive  search  and 
the  energy  obtained  by  solving  the  proposed  relaxed  problem  [P .  3] ,  versus  ath , 
for  different  values  of  n . 


result  with  the  threshold  ath  =  2  x  10~^.  In  Fig.  1,  we  report  the 
average  energies  and  Er  versus  r].  We  can  note  from  Fig.  1 
that  the  loss  in  terms  of  optimal  energy  due  to  the  relaxation  of 
the  original  problem  is  negligible  (the  energy  loss  is  less  than 
1.7  dB). 

Example  2:  Impact  of  Thresholding  Operation  on  Final 
Topology:  Clearly,  the  selection  of  the  threshold  ath  plays  a 
role  in  the  identification  of  the  final  topology.  To  evaluate  the 
impact  of  ath  on  the  final  topology,  in  Fig.  2  we  report  the 
ratio  ^  versus  ath  for  different  numbers  of  nodes  n  and  for 
?7  =  5.  The  energies  are  averaged  over  100  independent  node 
realizations.  We  can  observe  from  Fig.  2  that  there  is  a  wide 
range  of  values  of  ath  such  that  the  energy  loss  is  practically 
independent  of  ath-  This  shows  that  our  proposed  procedure  is 
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rather  robust  with  respect  to  the  choice  of  ath-  It  is  now  inter- 
esting to  check  the  effect  of  the  threshold  ath  on  the  topology, 
considering  larger  scale  networks  (in  this  case,  we  can  only  run 
our  algorithm,  as  the  exhaustive  search  is  not  feasible).  As  an 
example,  in  Fig.  3  we  show  the  topologies  obtained  by  solving 
problem  [P. 3],  for  a  network  composed  of  n  =  30  nodes,  using 
different  values  of  ath,  for  =  6.  Comparing  the  four  cases 
reported  in  Fig.  3,  we  notice  that,  only  for  very  low  values  of 
the  threshold  [i.e.,  case  (d)],  we  appreciate  a  sensitive  change 
of  topology,  whereas  for  a  large  range  of  values  of  ath,  the  final 
topology  is  practically  the  same.  This  means  that  the  proposed 
algorithm,  in  spite  of  the  relaxation  step  and  the  subsequent 
quantization,  yields  rather  stable  solutions. 

The  previous  results  pertain  to  a  specific  realization  of  the 
node  locations.  To  provide  results  of  more  general  validity,  in 
Fi^.  4,  we  report  the  average  value  of:  a)  fraction  of  active  links 

^^u^lr)  ^ '  ^)  {L) ;  and  c)  the  average  energy  as  a  function 
of  tne  threshold  ath-  The  averages  are  carried  out  over  100  inde- 
pendent realizations  of  the  nodes  location.  From  Fig.  4,  we  ob- 
serve that  there  is  an  interval  of  values  of  ath  (roughly,  between 
10~^  to  10~^)  for  which  we  obtain  a  strong  reduction  in  the 
fraction  of  active  links,  with  respect  to  the  situation  where  there 
is  no  threshold,  still  achieving  nearly  the  same  performance,  in 
terms  of  algebraic  connectivity  and  energy  consumption.  This 
is  indeed  an  important  result,  as  it  shows  that  the  relaxed  algo- 
rithm is  weakly  sensitive  to  the  choice  of  ath- 
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Fig.  4.  Average  value  of  (a)  fraction  of  active  links,  (b)  A2(L).  (c)  energy 
versus  threshold  value  fox  rj  —  6 . 


Example  3:  Impact  of  Propagation  Parameters  on  Final 
Topology:  It  is  also  interesting  to  look  at  the  change  in 
topology  as  a  function  of  the  radio  communication  model.  To 
this  end,  in  Fig.  5,  we  plot  the  optimal  topologies  achieved  for 
the  same  node  locations  as  in  Fig.  3,  but  pertaining  to  different 
path  loss  exponents  ry,  for  a  given  threshold.  Interestingly,  we 
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Fig.  5.  Optimal  topologies,  for  different  path  loss  exponents  /y  and  fixed  threshold  (at 


:  0.09):  (a)  /y  =  3.  (b)  /y  =  4.  (c)  /y  =  5.  (d)  /y  =  6. 


notice,  as  expected,  that,  as  the  propagation  loss  increases  (i.e., 
T]  increases),  the  network  tends  to  become  more  and  more 
sparse. 

The  results  shown  in  Fig.  5  refer  to  a  single  realization  of  the 
node  positions.  To  draw  conclusions  of  more  general  validity, 
we  averaged  over  100  statistically  independent  realizations  of 
the  node  locations,  for  networks  of  30  nodes.  In  Fig.  6,  we  report 
the  same  performance  metrics  as  in  Fig.  4,  but  now  as  a  function 
of  the  path  loss  exponent  ry,  setting  ath  =  0.09.  From  Fig.  6, 
we  observe  that  when  the  attenuation  is  high  (i.e.,  r]  is  large), 
reducing  the  number  of  links  (making  the  topology  sparser)  is 
more  important  than  reducing  convergence  time.  Conversely, 
when  the  attenuation  is  low  (i.e.,  rj  is  small),  increasing  network 
connectivity  is  more  important  than  reducing  power  consump- 
tion. This  behavior  sounds  reasonable  and  in  agreement  with 
intuition. 

V.  Optimal  Topology  for  Random  Geometric  Networks 

In  this  section,  we  remove  the  assumption  that  the  node  lo- 
cations are  known  a  priori  and  model  the  network  as  a  random 
geometric  graph  (RGG).  In  such  a  case,  the  graph  connectivity 
properties  and  the  convergence  time  can  only  be  established  in 
a  probabilistic  sense,  asymptotically,  as  the  number  of  nodes 
tends  to  infinity.  We  refer  to  [22]  for  the  first  basic  result  about 
the  convergence  of  consensus  algorithms  over  random  graphs 
and  to  [23]  for  a  more  recent  generalization  of  the  convergence 


Fig.  6.  Average  value  of  (a)  fraction  of  active  links,  (b)  X2{L).  (c)  energy 
versus  path  loss  j]  for  ath  =  0.09. 


conditions  under  stochastic  disturbances.  As  shown  in  [22], 
the  rate  of  convergence  of  consensus  algorithms  in  the  random 
graph  case  is  dictated  by  the  expected  value  £^[e~^^^'^2(^)] 
In  the  following,  exploiting  the  concentration  properties  of  the 
eigenvalues  of  RGGs  [24],  [25],  we  will  show  how  to  relate 
the  convergence  time  to  the  expected  value  of  A2  (L) .  This  link 
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will  be  fundamental  to  derive  the  optimal  coverage  radius,  and 
then  transmit  power,  that  minimize  the  energy  consumption 
necessary  to  achieve  consensus  over  RGGs.  To  achieve  this 
goal,  it  is  fundamental  to  recall  and  extend  some  results  about 
the  spectrum  of  RGGs. 

A.  RGGs 

A  random  graph  is  obtained  by  distributing  n  points  randomly 
over  the  (i-dimensional  space  and  connecting  the  nodes  ac- 
cording to  a  given  rule.  Let  Vn  =  {xi^X2^ ...  be  a  set 
of  (i-dimensional  vectors  Xi,  belonging  to  a  compact  set  Q  = 
[—1/2, 1/2]^  of  IR^,  whose  entries  denote  the  coordinates  of  the 
nodes  in  [R^,  and  let  ||  •  ||  be  the  Euclidean  norm  on  U^.  The 
graph  topology  is  captured  by  the  adjacency  matrix  A  which, 
in  this  case,  is  a  random  matrix.  An  important  class  of  random 
matrices,  encompassing  the  adjacency  matrix  of  our  problem, 
is  the  so-called  Euclidean  Random  Matrix  (ERM)  class,  intro- 
duced in  [26] .  Given  a  set  of  n  points  located  at  positions  Xi^i  = 
1,. ..,n,  an  nxn  adjacency  matrix  A  is  an  ERM  if  its  generic 
(i,  j)  entry  depends  only  on  the  difference  X2,  J  ^  I.e.,  — 
F{xi  —  Xj),  where  F  is  a  measurable  mapping  from  [R^  to  [R. 
An  important  subclass  of  ERM  is  given  by  the  adjacency  ma- 
trices of  the  so-called  RGGs.  In  such  a  case,  the  entries  aij  of 
the  adjacency  matrix  are  either  zero  or  one  depending  only  on 
the  distance  between  nodes  i  and  j,  i.e. 

if  \\x^  —  xA\  <  r 


{I 


otherwise 


(20) 


where  r  is  the  coverage  radius.  This  is  a  particular  case  of  an 
ERM,  corresponding  to  having 


F{x,-x,)  =  \^    if  11^.-^. 

1 0  otherwise 


(21) 


We  will  use  the  symbol  G(n^r)  to  indicate  an  RGG  composed 
of  n  points,  with  coverage  radius  r. 

The  RGG  model  is  the  most  appropriate  to  capture  the 
topology  of  a  wireless  network,  as  it  basically  states  that  there 
is  a  link  between  two  nodes  only  if  they  are  within  the  coverage 
radius  of  each  other^. 

Next,  we  recall  some  of  the  most  important  properties  of 
RGG's,  in  terms  of  connectivity  and  spectrum,  as  they  are  rele- 
vant to  our  optimization  problem. 

1 )  Connectivity:  Some  interesting  results  on  the  asymptotic 
connectivity  of  random  geometric  graphs  have  been  derived  in  a 
seminal  work  by  Gupta  and  Kumar  [27]  who  proved  that,  given 
a  set  of  n  points  uniformly  distributed  within  a  unit  square  (i.e., 
d  =  2),  the  graph  is  connected  almost  surely  if  the  coverage 
radius  behaves  as 


ro(n)  = 


log(n)  +  Cn 


7rn 


with  ^  00,  as  n  ^  00.  Conversely,  if  —00,  the  graph 
is  disconnected  almost  surely.  This  means  that  the  expression 

represents  a  threshold  distance.  In  the  following,  we 


log(^) 


^In  practice,  a  wireless  channel  is  also  affected  by  fading,  multipath,  and  shad- 
owing. Hence,  the  presence  of  a  link  between  two  nodes  depends  on  two  sources 
of  randomness:  distance  and  fading.  In  this  work,  we  concentrate  on  the  single 
source  of  randomness,  given  by  the  distance  between  the  nodes,  but  considering 
both  sources  of  randomness  will  be  an  interesting  extension  of  this  paper. 


will  often  use  the  shorthand  notation  r+(n)  to  indicate  the  law 
^iog(n)+cr^  ^j^j^  _^  00,  as  n  00.  Hence,  a  coverage 
radius  ro(n)  behaving  as  r+(n)  represents  a  law  that  ensures 
connectivity  with  high  probability,  as  n  ^  00.  We  will  use  the 
notation  ro(n)  ~  r+(n)  to  indicate  such  a  behavior. 

In  [4],  [28],  it  has  been  shown  that  the  degree  of  a  RGG 
G(n^r)  of  points  uniformly  distributed  over  a  two-dimensional 
unit  torus'^  is  equal  to 


d{n) 


(22) 


with  high  probability,  i.e.,  with  probability  1  —  1/n^ ,  if  the  radius 
behaves  as  ro(n)  ~  r+(n).  This  implies  that  an  RGG  tends  to 
behave,  asymptotically,  as  a  regular  graph,  if  the  coverage  radius 
is  chosen  so  as  to  guarantee  connectivity  with  high  probability. 

We  are  primarily  interested  in  the  second  eigenvalue  of  the 
Laplacian,  L  =  D  —  A,  where  D  is  the  degree  matrix  and  A  is 
the  adjacency  matrix  [see  (1)].  From  (22),  D  =  Trr^n/,  so  that 
we  only  need  to  investigate  the  second  largest  eigenvalue  of  A. 
Hence,  in  the  ensuing  section,  we  study  the  spectrum  of  A. 

2)  Spectrum  of  a  Random  Geometric  Graph:  In  [24],  [25], 
it  is  shown  that  the  eigenvalues  of  the  adjacency  matrix,  or  of 
the  transition  probability  matrix  ^,  tend  to  be  concentrated,  as 
the  number  of  nodes  tend  to  infinity.  In  particular,  in  [24]  it  is 
shown  that  the  eigenvalues  of  the  normalized  adjacency  matrix 
An  —  Ajn  of  an  RGG  G(n,  r),  composed  of  points  uniformly 
distributed  over  a  unit  bidimensional  torus,  tend  to  the  Fourier 
series  coefficients  of  the  function  F  defined  in  (21) 


-I 


exp  {—27rjz^x)dx 


(23) 


almost  surely,  for  all  z  =  [^1,^2]  ^  where  =  {x  = 
[xi^X2]'^  G  IR^  :  <  r}.  Using  polar  coordinates,  i.e., 

xi  =  p  sin  0  and  X2  =  p  cos  0,  with  0  <  p  <  r  and  0  <  ^  <  27r, 
we  obtain 


F{z)  =        /     exp  {—27rjp{zi  sin  0  -\-  Z2  cos  0))pdpdO. 
Jo  Jo 

This  integral  can  be  computed  in  closed  form.  Setting  zi  = 

A  sin  ^  and  Z2  —  A  cos  ^,  we  have 

F(yl,^)=  /    /  exp(-27rj>ylcos(^))p(ip(i^ 
Jo  J -4) 

with  ^  —  Q  —  (\).  Furthermore,  using  the  integral  expression 
for  the  Bessel  function  of  the  first  kind  of  order  k,  Jk{x)  = 
^  Jl^  exp  {jx  sin(0  -  jk^)d^,  we  get 

F{A,  (/))  =  F{A)  =  271  Jo{27rpA)pdp. 
Jo 

Finally,  using  the  identity  vJo(v)dv  =  uJi  (u),  we  can  make 
explicit  the  dependence  of  F{A)  on  the  index  pair  [^1, 2^2] 

F{zuZ2)  =  -r^^  Ji  (2^r^zl  +  zi\  .  (24) 

torus  geometry  is  typically  used  to  get  rid  of  border  effects. 

^The  transition  probability  matrix  is  the  adjacency  matrix,  normalized  with 
respect  to  the  node  degree,  so  that  the  row  of  the  adjacency  matrix  is  divided 
by  the  degree  of  node  i. 
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Fig.  7.  Verification  of  the  inequalities  in  (26). 


This  formula  allows  us  to  rank  the  eigenvalues  of  =  A/n. 
In  particular,  we  are  interested  in  the  second  largest  eigen- 
value of  An.  Considering  that  the  minimum  coverage  radius 

ensuring  connectivity  behaves  as  r(n)   ~    \/^^^^'  i-^-' 
is  a  vanishing  function  of  n,  we  can  use  the  Taylor  series 
expansion  of  ^(^1,^2),  for  small  r.  Recalling  that,  for  small 
X,  Ji{x)  =         —  +  o{x^),  we  can  approximate  the 

eigenvalues  as 

F{z,,z,)  =  Trr^  -  ^'('i+'D^'  +  oir%  (25) 

This  expansion  shows  that,  at  least  for  small  r,  the  largest  eigen- 
value equals  irr^  and  occurs  at  zi  =  Z2  =  0,  whereas  the  second 
largest  eigenvalue  corresponds  to  the  cases  {zi  =  1,2^2  =  0) 
and  {zi  =  0,^2  =  !)•  More  generally,  we  can  check  numeri- 
cally that,  for  r  <  1/2  and  A  >  1,  the  following  inequalities 
hold  true: 

Trr^  >  rJi (27rr)  >  ^\Ji {27rrA) \ .  (26) 

The  validity  of  these  inequalities  can  be  verified  from  Fig.  7, 
which  shows  the  three  terms  in  (26)  as  a  function  of  r  and  A. 

In  summary,  denoting  the  spectral  radius  of  A^  as  Ci(^n)  = 
maxi<^<^  where  {Xi{n)}f^i  is  the  set  of  eigenvalues 

of  A,  it  follows  that 

lim  Ci(An)  =  max  |  F{z)  |=  F(0, 0)  =  vrr^  (27) 

while  the  second  largest  eigenvalue  of  A^^  (2{An),  converges 
to 

lim  C2(^n)  =  ^(1,0)  =  F(0,1)  =  rJi(27rr).  (28) 

n— ^oo 

We  are  now  able  to  derive  the  asymptotic  expression  for  the 
second  largest  eigenvalue  of  the  normalized  Laplacian  Ln  = 
Dn  —  An,  where  :=  Z>/n  is  the  normalized  degree  matrix. 
Because  of  the  asymptotic  property  of  the  degree  of  an  RGG, 


n  =  1024 


Fig.  8.  Asymptotic  analysis  and  simulation  results  for  some  eigenvalues  of 
and  for  A2(L^)  versus  the  transmission  radius. 

shown  in  (22),  the  second  largest  eigenvalue  of  tends  asymp- 
totically to 

X2{Ln)=7rr^-C2{An).  (29) 

Thus,  the  algebraic  connectivity  of  the  graph  can  be  approxi- 
mated, asymptotically,  as 

X2{L)  =  Trnr^  -  nrJi(27rr).  (30) 

Since  the  previous  expressions  have  been  derived  in  the 
asymptotic  regime,  in  Fig.  8  we  compare  the  analytic  formulas 
for  the  first  and  the  second  largest  eigenvalues  of  A^,  as  given 
in  (27)  and  (28),  with  the  numerical  results  obtained  by  aver- 
aging over  100  independent  realizations  of  RGG's  composed 
of  n  =  1024  nodes.  We  can  notice  the  good  agreement  in 
Fig.  8  between  the  theoretical  expression  for  the  algebraic 
connectivity  X2{Ln),  given  in  (29),  and  the  simulation  results. 

B.  Minimization  of  the  Energy  Consumption:  An  Analytic 
Approach 

We  can  now  exploit  the  previous  analytic  expressions  to  study 
the  energy  minimization  problem  for  RGG's.  In  the  random 
topology  case,  since  the  distances  are  unknown,  we  cannot  op- 
timize the  power  associated  with  each  link.  However,  we  can 
seek  the  common  transmit  power  that  minimizes  energy  con- 
sumption. Thus,  in  the  random  setting  we  assume  a  broadcast 
communication  model,  where  each  node  broadcasts  the  value 
to  be  shared  with  its  neighbors.  In  the  lack  of  any  information 
about  distances  among  the  nodes,  we  assume  that  each  node  uses 
the  same  transmit  power.  In  this  case,  the  network  topology  can 
be  modeled  as  a  random  graph  model.  It  can  be  shown  [22], 
[23]  that  the  system  in  (3)  converges  to  consensus  almost  surely, 
i.e.,  Pr{  lim  x{t)  =  x*l}  =  1  assuming  that  each  node  has  a 
coverage  radius  so  that  the  network  is  asymptotically  connected 
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with  probability  one  [14].  Then  the  rate  of  convergence  of  the 
dynamical  system  in  (3)  is  given  [22],  [23]  by  E'[e"^^^^2(l^)] 
Note  that  defining  /(A2)  =  e~^^^^2(^)  ^nd  using  the  Taylor 
series  expansion  of  /(A2)^  at  the  point  A2  =  ^[A2]  =  mx^,  we 
can  write 

/(A2)  =  /(mAj  +  f^'\mx,){X2  -  mx,) 
H  2  ^  2  -  mxj 

+f:^^^(^-™Aj^  (31) 


k=3 


Consequently,  taking  the  expected  value,  we  get 


(32) 


denoting  with  a^^  =  £'[(A2  —  171x2)'^]  the  variance  of  A2.  But 
since  all  central  moments  of  order  greater  than  one  of  the  eigen- 
values tend  to  zero,  because  of  the  concentration  property,  we 
can  use  the  approximation 

E[e-^^^^^]^e-^^^^^''^l  (33) 

As  a  consequence,  the  energy  spent  to  achieve  consensus  can 
now  be  approximated  as 


E  =  K 


np 


2E[X2{L{p))]' 


(34) 


This  is  the  performance  metric  we  wish  to  minimize  in  the 
random  scenario,  with  respect  to  the  single  unknown  p. 

In  particular,  using  the  asymptotic  expression  (30)  for  the  al- 
gebraic connectivity,  we  can  introduce  the  following  metric 


S{r) 


npmin[l  +  (r/ro)^] 
nirr'^  —  rnJi(27rr) 


(35) 


We  now  check,  numerically,  that  the  function  E{r)  given  in  (35) 
is  a  convex  function  of  r,  for  ro(n)  <  r  <  0.5,  where  ro{n)  ~ 
r+(n),  to  ensure  connectivity. 
Let  us  rewrite  (35)  as 


f(r)  = 


with  g{r)  =  npnim[l  +  (^/'^o)^]  and  A2(r)  as  in  (30).  The  first- 
and  second-order  derivatives  of  g{r)  are,  respectively 

/,  ,      dg(r)  r"^"^ 
9  in  =  —7—  =  npminV — :-  >  0 


g  (r) 


dr 


To'' 


nPminViV  -  1)   >  0 


^For  simplicity  we  drop  in  A  2  the  dependence  on  L. 


SO  that  ^(r)  is  a  convex  increasing  positive  function  of  r.  Let  us 
now  study  the  behavior  of  A2(r).  Using 

d[r^Jm{r)] 


dr 


=  r'^J^_^{r)    for    m  =  0,1,2,... 


we  obtain 


X^{r)  =  =  27rrn[l  -  Jo(2^r)] 

with  A2(r)  >  0  for  ro(n)  <  r  <  0.5.  Moreover,  since 

— :j  = —Jm{r)  -  Jm-\-i{r)     for    m  =  0,1,2,... 

dr  r 

we  have 

A2  (r)  =  ^^^^  =  27rn[l  -  Jo(27rr)]  +  47rVnJi(27rr). 

(36) 

Observe  that  the  first  term  on  the  right-hand  side  (RHS)  of  (36) 
is  always  positive.  Furthermore,  if  r  <  1/2,  the  second  term  is 
also  positive,  since  Ji(27rr)  >  0  for  r  <  1/2.  Hence,  A2  (r)  >  0 
and  we  can  conclude  that  the  algebraic  connectivity  is  an  in- 
creasing and  convex  function  of  r  for  ro(n)  <  r  <  0.5,  where 
ro(n)  ~  r+(n). 

We  can  now  compute  the  first-  and  second-order  derivatives 
of  the  energy  function  with  respect  to  r.  We  get 


£{r) 


d'jr)  _  X'2{r)g{r) 
Hr)  Ai(r) 


and,  substituting  the  corresponding  expressions,  the  extremal 
points  can  be  obtained  by  solving  the  following  nonlinear 
equation: 

E{r)  =  ai(r)  +  a2(r)  Jo(27rr)  -  a3(r)  Ji(27rr)  =  0 

with  ai(r)  =  7r[{r]  -  2)r^  -  2r^],  a2(r)  =  27r(r^  +  ro^)  and 
a^(r)  =  r]r^~^.  Furthermore,  the  second  derivative  of  E{r)  is 
given  by  the  equation  shown  at  the  bottom  of  the  page.  In  Fig.  9, 
we  report  S  (r)  as  a  function  of  r,  for  different  values  of  r]. 
From  Fig.  9,  we  can  check  that  the  second  derivative  is  always 
positive  in  the  range  of  interest.  This  verifies  that  E{r)  is  in- 
deed a  convex  function  of  r  defined  on  the  compact  convex  set 
^o(^)  <  <  0-5.  As  a  consequence,  we  can  state  that  there 
is  always  at  least  a  radius  r  that  globally  minimizes  the  energy 
consumption  in  a  RGG. 

Numerical  examples.  In  Fig.  10,  we  compare  the  value  of  S{r) 
obtained  by  our  theoretical  approach  and  by  simulation,  for  var- 
ious values  of  the  path  loss  exponent,  2  <  7^  <  6.  The  results 
are  averaged  over  100  independent  realizations  of  random  geo- 
metric graphs  composed  of  n  =  1000  nodes.  For  each  r],  the  pair 
of  radius  and  energy  providing  minimum  energy  consumption  is 
indicated  by  a  circle  (simulation)  or  a  star  (theory).  We  observe 


£"ir) 


(/(r)A2(r)  -  2X',{r)g'{r))X2{r)  +  g{r){2X',  (r)  -  A;'(r)A2(r)) 

Ai(r) 
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Fig.  9.  Second  derivative  of  the  energy  versus  the  transmission  radius  for  sev- 
eral values  of  ij. 


Fig.  11.  Optimal  transmission  radius  versus  /y. 
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Fig.  10.  Global  energy  consumption  versus  transmission  radius  for  an  RGG; 
theoretical  values  (solid)  and  simulation  (dashed). 


that  the  theoretical  derivations  provide  a  very  good  prediction 
of  the  performance  achieved  by  simulation.  Furthermore,  for 
each  T],  there  is  a  coverage  radius  value  that  minimizes  energy 
consumption. 

The  optimal  values  of  the  coverage  radius  as  a  function  of  r], 
as  predicted  by  our  theoretical  derivations  or  by  simulation,  are 
reported  in  Fig.  11.  From  this  figure,  we  observe  that  there  is 
a  clear  transition  from  the  low  power  attenuation  regime  (i.e., 
f]  <  3.5),  where  the  optimal  radius  tends  to  make  the  network 
fully  connected,  as  opposed  to  the  strong  attenuation  situation 
{t]  >  4.5),  where  the  network  is  minimally  connected.  This 
means  that  in  the  low  attenuation  case,  minimizing  convergence 
time  is  more  important  than  minimizing  power  consumption. 


Fig.  12.  Global  energy  consumption  versus  per  node  transmit  power  for  an 
RGG;  theoretical  values  (solid)  and  simulation  (dashed). 


Conversely,  in  the  large  attenuation  case,  it  is  more  important  to 
minimize  power  consumption,  by  limiting  the  number  of  links 
as  much  as  possible,  compatibly  with  the  constraint  of  ensuring 
connectivity. 

Fig.  12  shows  the  average  energy  consumption  versus  the  per 
node  transmit  power  p,  under  the  same  settings  of  Fig.  10.  The 
circles  (simulation)  and  the  stars  (theory)  represent,  again,  the 
values  of  p  that  minimize  energy  consumption,  for  each  r].  It 
can  verified  the  existence  of  an  optimal  transmit  power  value 
minimizing  the  energy  consumption. 

Finally,  Fig.  13  shows  the  optimal  per  node  power  versus  rj. 
For  low  values  of  rj,  the  transmit  power  increases  with  r]  because 
it  must  cope  with  higher  attenuations  to  guarantee  connectivity. 
Conversely,  for  large  values  of  r],  the  optimal  power  decreases 
with  T]  because,  in  such  a  case,  it  is  more  beneficial  to  limit 
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Fig.  13.  Optimal  per  node  transmit  power  versus  77. 


Fig.  14.  Network  algebraic  connectivity  versus  per  node  transmit  power  for 
several  values  of  the  path  loss  coefficient. 


the  number  of  links,  until  the  minimum  power  guaranteeing  the 
connectivity  constraint  is  reached. 

C  Random  Topologies  versus  Uniformly  Spaced  Grids 

Finally,  it  is  interesting  to  compare  the  energy  consumption 
achievable  with  a  random  geometric  graph  and  a  deterministic 
grid.  In  the  deterministic  case,  we  may  also  distinguish  between 
a  graph  with  nodes  scattered  arbitrarily  in  a  given  area,  but  with 
positions  known,  and  a  regular  uniform  grid,  where  the  points 
are  located  over  a  rectangular  grid. 

1 )  Eigenvalues  of  a  Planar  Uniform  Grid:  We  start  by  de- 
riving the  algebraic  connectivity  of  a  square  grid  whose  n  nodes 
are  uniformly  spaced  within  a  unit  square,  at  a  distance  S  = 
l/m  =  l/\/n-  To  avoid  undesired  border  effects,  we  consider 
the  wrapping  of  the  unit  square  in  order  to  form  a  toroidal  sur- 
face. Each  node  is  assumed  to  have  a  link  with  the  neighboring 
nodes  only  if  they  are  at  a  distance  less  than  a  coverage  radius 
r{p),  that  depends  on  the  transmit  power  p,  as  with  geometric 
graphs.  In  Appendix  A,  we  derive  a  closed  form  expression  for 
the  network  degree  and  for  the  algebraic  connectivity  [see,  e.g., 
(45)-(48)]. 

A  numerical  check  of  our  derivations  is  reported  in  Fig.  14, 
where  we  show  the  numerical  value  of  A2  (L) ,  obtained  through 
the  eigendecomposition  of  L,  and  the  value  given  in  (48),  for  dif- 
ferent values  of  r].  From  Fig.  14,  we  can  see  a  perfect  agreement 
between  our  closed  form  expression  and  the  numerical  results. 
Notice,  in  particular,  the  sharp  transition  behavior:  the  eigen- 
values tend  to  n  or  0  depending  on  whether  the  transmit  power 
is  above  or  below  a  threshold;  further,  the  threshold  increases 
with  77. 

In  Fig.  15,  we  compare  the  algebraic  connectivity  of  the  rect- 
angular grid,  given  by  (48),  with  the  theoretical  value  obtained 
for  the  random  geometric  graph,  given  in  (30),  assuming  the 
same  node  density,  over  the  same  toroidal  surface.  As  shown  in 
previous  works,  see,  e.g.,  [4],  RGGs  tend  to  behave  asymptoti- 
cally as  a  regular  graph.  The  result  shown  in  Fig.  15  is  a  further 
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Fig.  15.  Algebraic  connectivity  of  random  geometric  graph  and  rectangular 
grid  versus  transmission  radius. 


confirmation  of  this  property.  In  the  same  figure,  we  also  re- 
port simulation  results  for  the  algebraic  connectivity,  obtained 
by  averaging  over  100  independent  realizations  of  a  RGG.  We 
can  check,  once  again,  the  good  agreement  between  theory  and 
simulation. 

Finally,  in  Fig.  16  we  compare  the  energy  consumption  ob- 
tained assuming  full  a  priori  knowledge  of  the  nodes'  locations 
or  no  knowledge  at  all.  In  the  first  case,  the  topology  and  the 
power  allocation  over  each  link  are  optimized  according  to  the 
method  illustrated  in  Section  IV.  The  optimal  values,  for  each 
T],  are  indicated  by  colored  dots.  In  the  second  case,  we  report 
(solid  line)  the  energy  consumption  versus  the  average  power, 
assuming  that  all  nodes  transmit  with  the  same  power  (since  they 
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Fig.  16.  Average  network  energy  consumption  versus  average  network  power. 


do  not  have  any  prior  information  about  the  other  nodes  posi- 
tions). To  make  a  fair  comparison  between  the  two  cases,  we 
consider  in  both  cases  the  sum  of  the  powers  associated  with 
each  active  Hnk.  This  means  that,  in  the  second  case,  the  energy 
consumption  is  measured,  for  every  common  transmit  power  p, 

as  E{p)  =  — '  where  Ui  (p)  denotes  the  number  of  neigh- 

bors of  node  i.  Tlie  results  shown  in  Fig.  16  have  been  obtained 
by  averaging  the  results  obtained  in  the  two  settings  over  the 
same  set  of  100  independent  random  deployments  of  n  =  30 
nodes,  uniformly  distributed  over  the  unit  square. 

From  Fig.  16  we  can  see  that,  as  expected,  the  method  as- 
suming perfect  a  priori  knowledge  of  the  node  positions  (in  the 
figure  denoted  as  is  indeed  able  to  achieve  better  per- 

formance than  the  other  method  {S{p)),  as  it  is  able  to  take  ad- 
vantage of  the  a  priori  knowledge.  The  optimal  average  power 
is  the  result  of  a  tradeoff  between  transmit  power  and  conver- 
gence time,  and  always  depends  on  the  path  loss  exponent. 


VI.  Conclusion 

In  this  paper  we  have  addressed  the  problem  of  finding  the 
network  topology  that  minimizes  the  energy  consumption  nec- 
essary to  achieve  consensus  in  a  WSN.  Assuming  a  simple  flat 
fading  propagation  model,  we  have  studied  two  main  network 
models:  1)  arbitrary  networks,  where  the  nodes  are  arbitrarily  lo- 
cated, but  with  known  positions,  and  2)  a  random  model,  where 
the  positions  are  not  known  a  priori,  but  are  modeled  as  random 
variables.  In  the  first  case,  we  have  shown  how  to  optimize  the 
network  topology  and  the  power  allocated  to  each  active  link 
in  order  to  minimize  the  total  energy  necessary  to  achieve  con- 
sensus within  a  prescribed  accuracy.  Topology  optimization  is, 
in  general,  a  combinatorial  problem  and  hence  computationally 
demanding.  To  simplify  the  solution,  we  introduced  a  relaxation 
step  that  enabled  us  to  reformulate  the  energy  minimization 


problem  as  a  convex-concave  fractional  program.  This  alterna- 
tive formulation  can  be  cast  as  an  equivalent  parametric  convex 
problem,  which  enables  efficient  solutions.  The  link  weights  re- 
sulting from  the  solution  of  the  convex  problem  must  then  be 
thresholded  to  find  out  the  network  topology.  We  have  shown 
through  numerical  results  that  the  thresholding  operation  may 
yield  a  considerable  reduction  of  the  number  of  active  links,  yet 
with  very  limited  effect  on  performance. 

In  our  deterministic  setting,  we  assumed  point-to-point  links 
and  we  optimized  the  power  over  each  link.  In  practice,  a  con- 
sensus algorithm  running  over  a  wireless  network  could  ben- 
efit from  the  broadcast  channel.  Deciding  between  broadcast  or 
one-to-one  links  entails  a  proper  choice  of  the  medium  access 
strategy,  to  establish  in  which  time  slot  each  node  has  to  listen 
to  which  broadcaster. 

Conversely,  in  the  random  network  case,  lacking  any  infor- 
mation about  the  intemode  distances,  we  assumed  a  broadcast 
communication  strategy.  In  such  a  case,  the  network  topology 
is  modeled  as  a  random  geometric  graph.  We  have  derived 
closed  form  expressions,  albeit  valid  only  asymptotically,  for 
the  algebraic  connectivity,  assuming  a  common  transmit  power. 
Then,  building  on  these  expressions,  we  have  shown  that  the 
energy  consumption  is  a  convex  function  of  the  coverage  radius. 
We  have  also  shown  that  a  random  geometric  graph  performs, 
asymptotically,  as  a  regular  graph  built  over  a  rectangular  grid. 
This  confirms  previous  results,  although  now  in  the  context  of 
energy  minimization  over  consensus  networks. 

Finally,  we  have  compared  the  performance  achievable  with 
arbitrary  and  random  graphs.  Clearly,  the  knowledge  of  the 
node  locations  allows  better  power  allocation,  that  translates 
into  lower  energy  consumption  to  achieve  consensus.  However, 
in  practice  there  is  a  price  associated  with  the  knowledge  of 
node  location.  This  knowledge  requires  the  acquisition  of 
the  node  positions  first  and  then  a  centralized  optimization. 
Conversely,  the  random  approach  can  be  followed  also  in  a 
decentralized  fashion,  with  only  minimal  information  about 
some  global  parameters  like  number  of  nodes  and  area  covered 
by  the  network,  and  it  does  not  need  any  extra  hardware  or 
computation  to  acquire  the  nodes'  locations. 

In  this  paper,  we  have  assumed  a  simple  flat  fading  channel 
model,  whose  effect  is  only  to  introduce  attenuation  and  su- 
perimpose noise.  The  simple  model  captures  the  essence  of  the 
problem  and  keeps  the  overall  problem  complexity  under  con- 
trol. However,  looking  at  potential  applications,  it  would  be 
interesting  to  generalize  the  approach  to  the  case  where  the 
channel  model  is  more  complicated.  Furthermore,  the  network 
topology  has  been  assumed  to  be  static.  However,  a  dynamic 
topology,  possibly  adapted  to  the  consensus  state,  might  pro- 
vide better  performance. 


Appendix  A 

In  this  section,  we  derive  an  analytical  expression  for  the 
eigenvalues  of  the  adjacency  matrix  ^  of  a  rectangular  grid  over 
a  unit  torus,  as  a  function  of  the  coverage  radius  r{p).  We  as- 
sume, for  simplicity  that  the  number  of  nodes  n  is  a  square 
number,  i.e.,  n  =  nf,  with  ni  integer.  We  number  the  rows  of 


36 


the  grid,  1  to  ni,  going  from  bottom  to  top.  The  nxn  matrix  A 
can  be  expressed  in  a  cycHc  block  form,  as 


-  Ao 

^1 

A2  . 

-2 

-1 

— 

1 

A,  . 

-3 

-2 
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Ao  . 

-4 

-3 

A, 


Ao 


where  each  block  Ai,  for  i  =  0, . . . ,  ni  —  1,  is  an  ni  x  ni 
circulant  matrix.  In  particular,  the  block  is  the  adjacency 
matrix  between  the  nodes  lying  on  the  same  row  of  the  grid, 
starting  from  the  leftmost  node  and  proceeding  along  the  row. 
Ai  represents  the  adjacency  matrix  between  the  nodes  of  row  j 
and  the  nodes  lying  on  the  {i  +  j)th  row,  for  i  =  1, . . . ,  ni  —  j, 
and  on  the  {i  +  j  —  ni)th  row  for  i  =  ni  —  j  +  1, . . . ,  ni  —  1. 
The  toroidal  structure,  used  to  avoid  border  effects,  reflects  into 
the  cyclic  structure  of  A,  so  that  we  have  A^^-i  =  Ai  if 

i  =  (m  —  l)/2  +  1, . . . ,  ni  —  1    and    ni  odd 
z  =  ni/2  +  1, . . . ,  ni  —  1    and    ni  even. 

We  also  note  that  ^  is  a  block  circulant  matrix  with  circulant 
blocks  (BCCB).  Since  each  block  for  z  =  0, . . . ,  ni  —  1  is 
an  ni  X  ni -dimensional  circulant  matrix,  the  eigenvalues  of  Ai 
admit  the  following  decomposition 


with 
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1  UJ' 
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where  uj  =  exp  {—j27r/ni);  Ai  is  a  diagonal  matrix  whose  en- 
tries are  the  eigenvalues  of  Ai,  i.e.,  the  discrete  Fourier  trans- 
form (DFT)  of  the  first  row  of  Ai .  Hence,  the  matrix  A  admits 
the  following  diagonalization: 

where  (g)  denotes  the  Kronecker  product  and 

ni  —  l 


k 


k=0 


with  ftn^  =  diag[l,  cj, . . . ,  uj^^~^].  The  entries  A;^^(^)  of  the 
matrix  A  can  be  also  expressed,  after  a  few  algebraic  manipu- 
lations, as 


n\—l n\—l 


i=0  k=0 

X  exp  {—j27rkl/ni)  exp  (—j27rmi/ni)  (37) 


for  /,  m  =  0, . . . ,  ni  —  1  and  the  coefficient  a^*^  (p)  is  A;th  entry 
of  the  first  row  of  for  z  =  0, . . . ,  ni  —  1.  Equivalently,  (37) 
can  be  seen  as  the  two-dimensional  DFT  of  a  matrix  A  whose 
rows  A{i^ :)  are  given  by  A{i^ :)  =  ^^(1, :)  for  z  =  0, . . . ,  ni  — 
1,  applying  the  1-DFT  to  the  rows  and  then  to  the  columns  of  A. 

In  general,  for  a  regular  grid  where  each  node  has  degree  d{p) , 
the  algebraic  connectivity  can  be  expressed  as 


A2(L(p))  =  d{rip)) 


ni  —  1  ni—1 

-EE  4'^(p)exp(-j27rfc/ni)  (38) 

i=0  k=0 

where  the  coefficients  ci^^\p)  can  be  expressed  as  follows.  Let 
u{x)  denote  the  step  function,  i.e.,  u{x)  =  1  for  x  >  0  and  zero 
otherwise.  For  ni  odd,  we  have 


(39) 


aQ^\p)  =  0  and 

a^o\p)  =  <r{p)  -iS)     i  =  1, . . . ,  (m  -  l)/2, 
a^^\p)  =  u{r{p)  —  kS)  and 

4^)(p)  =  u{r{p)  -  yji^  +  kH)        ^  =  1, . . . ,  (m  -  l)/2 

(40) 

^f^V)  =  cif^-k{v)     k  =  {m-  l)/2  +  1, . . .  ,ni  -  1  and 
^  =  0,...,(ni-l)/2.  (41) 

Finally  Ai  =  An^-i  for  i  =  (ni  —  l)/2  +  1, . . . ,  ni  —  1. 
Similarly,  in  the  case  of  ni  even,  we  have 


(42) 


aQ^\p)  =  0  and 

a^o\p)  =  u{r{p)  -i8)  z  =  1, . . .  ,ni/2, 

a^^\p)  =  u{r{p)  —  kS)  and 

a^\p)  =  u{r{p)  -  +  k^  6)    i,k  =  l,...,  ni/2,  (43) 

a^k\p)  =  ^nl-k(p)     k  =         +  1, . . . , ni  -  1  and 

z  =  0,...,ni/2  (44) 

with  Ai  =  An^-i  if  i  =  ni/2  +  1, . . . ,  ni  —  1. 

Finally,  we  can  derive  analytical  expressions  of  the  network 
degree  and  of  the  algebraic  connectivity  for  ni  odd  as 
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and 

(ni-l)/2 

X2{L{p))  =     Yl    ^^k\p)i^  -  cos(2^A;/ni)) 
k=i 

(ni-l)/2  (ni-l)/2 

.(0 


+  4    E       E  <^b)(l-cos(2^A;/ni)) 

(46) 


^=l  fc=l 


37 


and  for  ni  even 
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x(l-cos(27rfc/ni))+4   ^  a^/^Cp) 

ni/2-1  ni/2-1 

-4   ^     ^   4^^(p)cos(2^A;/ni).  (48) 
i=i  k=i 

For  example,  in  the  case  where  r{p)  <  i.e.,  assuming  a 
regular  grid  of  degree  d{p)  =  4,  we  obtain 

^^,_i=Ai=/    andAi  =  OV2  =  2,...,ni-2  (49) 

while  ^0  is  a  circulant  matrix  whose  first  row  is  the  ni -di- 
mensional vector  [0100 ...  001].  Then,  the  eigenvalues 
are  given  by  Xi{Ao)  =  2 cos(27r//ni)  and  applying  the 
formula  in  (37)  the  eigenvalues  of  A  are  Xi^rn{A)  = 
2cos(27r//ni)  +  2  cos(27rm/ni)  for  /,m  =  0,  ...,ni  —  1. 
The  second  largest  eigenvalue  of  A  is  obtained  for  /  =  1  and 
m  =  0  and  is  given  by  Ai,o(^)  =  2cos(27r/ni)  -|-  2.  Thus 
the  eigenvalues  of  the  Laplacian  matrix  can  be  expressed  as 
Xi(L)  =  4  —  Xi{A)  and  the  algebraic  connectivity  of  the 
network  is  given  by  X2{L)  =  2  —  2cos(27r/ni). 
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Abstract.  The  availability  of  both  global  and  regional  el- 
evation datasets  acquired  by  modern  remote  sensing  tech- 
nologies provides  an  opportunity  to  significantly  improve  the 
accuracy  of  stream  mapping,  especially  in  remote,  hard  to 
reach  regions.  Stream  extraction  from  digital  elevation  mod- 
els (DEMs)  is  based  on  computation  of  flow  accumulation, 
a  summary  parameter  that  poses  performance  and  accuracy 
challenges  when  applied  to  large,  noisy  DEMs  generated 
by  remote  sensing  technologies.  Robust  handling  of  DEM 
depressions  is  essential  for  reliable  extraction  of  connected 
drainage  networks  from  this  type  of  data.  The  least-cost  flow 
routing  method  implemented  in  GRASS  GIS  as  the  mod- 
ule r.watershed  was  redesigned  to  significantly  improve  its 
speed,  functionality,  and  memory  requirements  and  make  it 
an  efficient  tool  for  stream  mapping  and  watershed  analysis 
from  large  DEMs.  To  evaluate  its  handling  of  large  depres- 
sions, typical  for  remote  sensing  derived  DEMs,  three  differ- 
ent methods  were  compared:  traditional  sink  filling,  impact 
reduction  approach,  and  least-cost  path  search.  The  compari- 
son was  performed  using  the  Shuttle  Radar  Topographic  Mis- 
sion (SRTM)  and  Interferometric  Synthetic  Aperture  Radar 
for  Elevation  (IF S ARE)  datasets  covering  central  Panama  at 
90  m  and  10  m  resolutions,  respectively.  The  accuracy  as- 
sessment was  based  on  ground  control  points  acquired  by 
GPS  and  reference  points  digitized  from  Landsat  imagery 
along  segments  of  selected  Panamanian  rivers.  The  results 
demonstrate  that  the  new  implementation  of  the  least-cost 
path  method  is  significantly  faster  than  the  original  version, 
can  cope  with  massive  datasets,  and  provides  the  most  ac- 
curate results  in  terms  of  stream  locations  validated  against 
reference  points. 
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(markus.metz@uni-ulm.de) 


1  Introduction 

Shuttle  Radar  Topographic  Mission  (SRTM;  Farr  et  al., 
2007)  and  various  airborne  Interferometric  Synthetic  Aper- 
ture Radar  for  Elevation  (IF  S ARE)  surveys  provide  a  new 
generation  of  elevation  data  in  regions  that  have  had  only 
limited,  often  low  resolution  coverage.  These  topographic 
data  sets  are  increasingly  used  to  improve  mapping  of  geo- 
morphic  and  hydrologic  features,  especially  in  remote,  hard 
to  reach  areas  and  at  regional  to  global  scales  (e.g.,  Kinner 
et  al.,  2005;  Lehner  and  Doll,  2004;  World  Wildlife  Fund, 
2009). 

In  spite  of  significant  advances  in  the  development  of  flow 
routing  algorithms  (e.g.  Quinn  et  al.,  1991;  Costa-Cabral  and 
Surges,  1994;  Holmgren,  1994;  Quinn  et  al.,  1995;  Tarboton, 
1997),  accurate  extraction  of  drainage  networks  from  Digi- 
tal Elevation  Models  (DEMs)  generated  by  remote  sensing 
technologies  such  as  SRTM  or  IFSARE  remains  challeng- 
ing. To  fiilly  understand  the  problem,  it  is  important  to  real- 
ize that  the  mapped  elevation  surfaces  include  the  top  of  the 
forest  canopy,  as  well  as  anthropogenic  features,  rather  than 
the  ground  surface  required  for  flow  routing.  Although  ho- 
mogeneous forest  canopy  generally  follows  the  shape  of  the 
ground  surface,  gaps  in  vegetation  that  can  stretch  over  hun- 
dreds of  meters  create  large,  often  nested,  depressions  that 
pose  difficulties  for  flow  routing.  In  addition  to  these  large 
depressions,  the  surface  over  forested  areas  is  noisy,  creating 
numerous  small  depressions  and  barriers  that  fiirther  com- 
plicate drainage  network  extraction.  Therefore,  one  of  the 
important  questions  investigated  in  this  paper  was  whether 
such  data  are  suitable  for  drainage  network  extraction  at  all 
and  if  yes,  how  accurate  are  the  extracted  drainage  networks 
and  which  methods  provide  the  most  reliable  results. 

At  the  same  time,  the  broad  availability  of  elevation  data 
dramatically  increased  the  extent  of  regions  that  can  be  ana- 
lyzed at  relatively  high  resolutions,  given  the  computational 
capabilities  that  can  support  processing  of  massive  DEMs. 
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Therefore,  significant  effort  has  been  devoted  to  the  devel- 
opment of  new  flow  tracing  and  watershed  analysis  algo- 
rithms that  support  efficient  processing  of  large  DEMs  and 
address  the  issue  of  routing  through  complex  nested  depres- 
sions (e.g.  Rivix  Limited  Liability  Company,  2001;  Arge  et 
al.,  2003;  Banner  et  al.,  2007). 

The  most  widespread  method  for  handling  depressions  is 
sink  filling,  up  to  the  level  of  the  sink  spill  point,  com- 
bined with  routing  through  the  resulting  flat  area  (Jenson  and 
Domingue,  1988;  Fairfield  and  Leymarie,  1991;  Planchon 
and  Darboux,  2001;  Wang  and  Liu,  2006).  Improved  sink 
filling  methods  (e.g.  Garbrecht  and  Martz,  1997;  Grimaldi 
et  al.,  2007;  Santini  et  al.,  2009)  first  fill  sinks,  and  then  in- 
troduce a  gradient  to  all  flat  areas  to  provide  non-zero  gra- 
dients for  flow  routing.  The  method  complementary  to  sink 
filling  is  carving  or  breaching  (Rieger,  1998;  Martz  and  Gar- 
brecht, 1998)  where  a  channel  is  carved  out  of  each  sink, 
breaking  through  the  (artificial)  obstacle.  Both  principles  can 
be  combined  in  an  impact  reduction  approach  (IRA;  Lindsay 
and  Creed,  2005)  that  for  each  sink  determines  the  method 
that  causes  the  least  impact  on  the  source  dataset.  All  these 
approaches  alter  the  elevation  data  in  order  to  ensure  full 
drainage,  assuming  that  sinks  are  artifacts  created  either  by 
too  low  (artificial  pits)  or  too  high  (artificial  drainage  blocks) 
elevation  values. 

An  alternative  to  the  modification  of  elevation  data  is  to 
determine  the  least-cost  drainage  paths  (LCP)  through  un- 
altered terrain  and  out  of  sinks.  LCP  search  methods  were 
generally  designed  to  find  the  shortest  or  fastest  route  from  a 
starting  point  to  a  given  destination,  used  for  example  in  car 
navigation  devices  or  to  generate  cumulative  cost  surfaces.  A 
particular  LCP  search  method  (A*  Search;  Hart  et  al.,  1968) 
was  adapted  for  flow  routing  and  watershed  analysis  and  im- 
plemented as  the  module  r.watershed  (Ehlschlaeger,  1989)  in 
GRASS  GIS  (Neteler  and  Mitasova,  2008,  see  GRASS  De- 
velopment Team  2010  for  binaries  and  source  code).  When 
applied  to  IFSARE  derived  DEMs,  this  approach  has  pro- 
vided more  accurate  flow  routing  through  large,  nested  de- 
pressions with  fewer  artifacts  than  traditional  sink  filling 
(Kinner  et  al.,  2005).  However,  the  module  was  not  de- 
signed to  handle  the  massive  DEMs  that  are  currently  avail- 
able (Arge  et  al.,  2003).  To  render  the  module  suitable  for 
analysis  of  very  large  datasets  at  ever  higher  resolutions,  and 
to  reduce  the  artifacts  in  flow  patterns  due  to  the  single  flow 
direction  (SFD)  algorithm  (Quinn  et  al.,  1991;  Holmgren, 
1994;  Tarboton,  1997),  redesign  of  the  implementation  and 
addition  of  more  flow  routing  options  was  necessary. 

In  this  study,  we  present  substantial  improvements  to  the 
LCP  method  implementation  and  evaluation  of  its  efiiciency 
and  accuracy  when  applied  to  radar-based  DEMs.  We  com- 
pare the  performance  of  the  improved  module  with  (1)  its 
original  implementation,  (2)  a  module  based  on  a  disk  in- 
put/output (I/O)  efiicient  method  (Arge  et  al.,  2003),  and 
(3)  the  IRA  implementation  in  the  Terrain  Analysis  System 
(TAS,  Lindsay  and  Creed,  2005).  We  also  analyze  the  impact 


of  mapping  technology  (IFSARE,  SRTM),  resolution,  and 
DEM  resampling  on  the  accuracy  of  the  extracted  drainage 
networks.  Although  the  LCP  method  has  been  used  since 
early  90ies,  the  literature  that  describes  the  algorithm,  its 
implementation  and  properties  is  very  limited  (Ehlschlaeger, 
1989).  This  paper  aims  to  fill  this  gap  in  literature  and  to 
highlight  the  value  of  this  method  and  its  improvements  for 
analysis  of  modem  DEMs. 


2  Methods 

2.1    Improved  least  cost  path  search  algorithm 

The  LCP  algorithm  (A*  Search,  Hart  et  al.,  1968; 
Ehlschlaeger,  1989)  used  for  flow  routing,  flow  accumula- 
tion, and  watershed  analysis  of  raster-based  DEMs  was  re- 
designed to  increase  the  processing  speed  and  decrease  mem- 
ory consumption.  The  implementation  in  the  GRASS  mod- 
ule r.watershed  (Fig.  1)  starts  with  potential  outlet  points. 
Natural  ultimate  outlets  are  e.g.  river  mouths  opening  into 
oceans  or  lakes  without  outflow.  On  gridded  elevation  mod- 
els, potential  outlets  are  grid  cells  along  the  map  boundaries 
or  cells  with  at  least  one  neighbor  with  unknown  elevation, 
e.g.  masked  ocean.  All  potential  outlet  grid  points  are  in- 
serted into  a  list  sorted  by  costs.  Costs  are  measured  as  eleva- 
tion and  order  of  addition  to  the  sorted  list  (grid  cells  added 
earlier  have  higher  precedence  in  case  of  equal  elevation). 
For  the  actual  search,  the  grid  cell  with  the  lowest  eleva- 
tion (smallest  cost)  is  extracted  and  removed  from  the  sorted 
list,  marked  as  processed  and  its  neighbors  are  investigated. 
This  causes  the  search  to  proceed  along  the  least  steep  up- 
hill slope.  At  each  step  during  the  search,  only  neighboring 
grid  cells  that  are  not  yet  in  the  search  list  and  not  yet  pro- 
cessed are  added  to  the  list,  and  drainage  direction  for  these 
neighboring  grid  cells  is  set  towards  the  current  grid  cell.  If 
a  depression  is  encountered,  the  search  follows  the  steepest 
downhill  slope  to  the  bottom  of  a  depression  and  then  pro- 
ceeds again  along  the  least  steep  uphill  slope.  The  search 
proceeds  in  this  manner  upstream  along  the  least  steep  slope 
and  terminates  when  all  grid  points  have  been  processed.  The 
LCP  search  provides  two  results:  (a)  flow  direction  for  each 
cell  in  a  standard  D8  manner  (O'Callaghan  and  Mark,  1984), 
and  (b)  the  order  in  which  cells  must  be  processed  for  flow 
accumulation. 

The  search  component  of  the  original  implementation  was 
modified  by  augmenting  it  with  a  minimum  heap  used  as  pri- 
ority queue  (Atkinson  et  al.,  1986;  Metz  and  Ehlschlaeger, 
2010),  which  led  to  a  substantial  speed  improvement.  In 
the  original  implementation,  the  time  needed  to  keep  the 
sorted  list  sorted  increased  exponentially  with  the  number 
of  grid  cells,  while  in  the  new  implementation  this  time  in- 
crease is  logarithmic.  Removal  of  redundant  information  in 
intermediate  data  as  well  as  reduced  memory  requirements 
for  both  the  search  and  the  flow  accumulation  components 
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determine  ultimate  outlets 
and  add  them  to  the 
search  list 


fetch  point  with  lowest 
elevation  from  the  search 
list 


terminate  search 


j  search  list 
1  empty 


I  search  list  not  empty 


remove  current  point  from 
the  search  list  and  set  it  as 
new  start  point  for  flow 
accumulation 


add  unvisited  neighbors  to 
the  search  list  and  set 
initial  flow  direction  to 
current  point 


Fig.  1.  Flowchart  illustrating  the  least  cost  path  search  algorithm. 


have  further  enhanced  the  module's  capacity  to  process  larger 
DEMs.  For  massive  datasets  that  can  not  be  processed  with 
the  amount  of  memory  available,  the  module  can  optionally 
use  external  memory  with  intermediate  data  stored  on  disk. 
These  changes,  aimed  at  improving  the  computational  per- 
formance, did  not  affect  the  quality  of  the  results. 

To  improve  the  flow  routing  accuracy  and  to  reduce  arti- 
facts in  the  flow  accumulation  pattern,  a  multiple  flow  di- 
rection (MFD)  algorithm,  based  on  Holmgren  (1994),  was 
implemented  using  the  order  determined  by  the  LCP  search 
with  an  option  to  control  the  strength  of  flow  convergence. 
For  grid-based  elevation  models,  an  MFD  approach  con- 
forms better  to  theoretical  surface  flow  dispersion  than  single 
flow  direction  in  which  only  one  neighboring  cell  can  receive 
surface  flow  from  the  current  cell  (Freeman,  1991;  Quinn  et 
al.,  1991;  Holmgren,  1994;  Tarboton,  1997).  Initial  D8  flow 
directions  as  determined  by  the  LCP  search  are  used  to  route 
flow  out  of  depressions  when  all  unprocessed  neighboring 
grid  cells  have  higher  elevation  than  the  current  grid  cell. 


2.2    Elevation  data 

Two  different  sources  of  elevation  data  were  used  to  demon- 
strate the  improvements  in  the  LCP  method  (Fig.  2).  Coun- 
trywide elevation  coverage  of  Panama  was  available  as  a 
3  arc  second  resolution  SRTM  DEM  (Farr  et  al,  2007). 
SRTM  version  2.1  provided  by  the  NASA  Jet  Propul- 
sion Laboratory  (http://www2.jpl.nasa.gov/srtm/,  download 
at  http://dds.cr.usgs.gov/srtm/)  was  selected  for  this  study  as 
the  most  reliable  version  in  terms  of  accuracy  and  minimal 
artifacts,  after  evaluating  properties  of  the  SRTM  products 
available  at  the  time  of  writing  (vl,  v2.1,  v3,  v4.1).  The 
published  absolute  vertical  error  of  the  version  2.1  SRTM 
DEM  is  6.2  m  for  South  America  including  Panama  (Farr 
et  al.,  2007),  although  several  studies  report  higher  accura- 
cies (see  e.g.  Rodriguez  et  al.,  2006).  A  recent  airborne  IF- 
SARE  survey  has  provided  new,  more  detailed  information 
about  the  topography  in  central  Panama  at  10  m  resolution, 
using  technology  with  published  vertical  accuracy  of  ~3  m 
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Fig.  3.  Shaded  relief  of  a  Landsat  false  color  composite  (R,G,B  = 
4,5,3)  overlaid  with  GPS  reference  points  in  red  and  points  digitized 
from  Landsat  in  grey. 

(Andersen  et  al.,  2006).  IFSARE  (InterFerometric  Synthetic 
Aperture  Radar  -  Elevation)  is  derived  in  a  way  very  simi- 
lar to  SRTM,  with  the  main  difference  that  IFSARE  is  air- 
borne and  SRTM  data  was  recorded  with  sensors  onboard 
the  Space  Shuttle  Endeavour.  Both  IFSARE  and  SRTM  used 
two  antennas  to  collect  radar  data;  elevation  information  was 
derived  by  analyzing  the  data  recorded  by  the  two  antennas 
(the  general  concept  of  the  new  satellite -borne  TanDEM-X 
mission  is  similar,  collecting  radar  data  with  two  antennas). 
Neither  of  the  surveys  penetrated  the  forest  canopy  to  the 
ground  surface  and  the  elevation  surface  was  over  large  re- 
gions defined  by  a  triple  canopy  tropical  forest  environment 
with  tree  heights  of  more  than  30  m  above  ground.  Given 
the  widespread  use  of  the  SRTM  data,  one  of  the  important 
questions  explored  in  this  paper  was  whether  such  data  are 
suitable  for  stream  extraction  at  all,  and  if  yes,  what  is  the  ac- 
curacy and  which  methods  provide  the  most  reliable  results 
in  this  challenging  environment. 

SRTM  v2  tiles  covering  all  of  Panama  were  combined  and 
gaps  in  the  dataset  filled  using  the  regularized  spline  with  ten- 
sion (RST)  interpolation  method  (GRASS  module  r.fillnulls\ 
Mitasova  and  Mitas,  1993).  The  seamless  SRTM  coverage 
was  then  reprojected  with  bicubic  interpolation  from  geo- 
graphic to  the  UTM  zone  1 7N  coordinate  system  with  90  m 
resolution  to  keep  reprojection  modifications  to  a  minimum. 
To  evaluate  the  impact  of  resampling  on  stream  extraction 
accuracy  (see  e.g.  Valeriano  et  al.,  2006)  and  to  generate 
data  for  additional  performance  testing,  the  reprojected  90  m 
SRTM  DEM  was  reinterpolated  to  30  m  resolution  using  the 
RST  method. 


The  IFSARE  data  were  provided  as  a  10  m  resolution 
DEM  (Kinner  et  al.,  2005)  in  the  UTM  zone  17N  coordi- 
nate system  and  did  not  require  additional  processing.  For 
additional  testing  purposes,  the  IFSARE  DEM  was  down- 
sampled  to  30  m  resolution  using  bicubic  interpolation  in  or- 
der to  have  DEMs  from  two  sources  (SRTM  and  IFSARE) 
with  different  level  of  detail,  but  identical  resolution  of  30  m. 

2.3    Stream  location  data 

Two  sets  of  data  points  were  used  to  evaluate  horizontal  accu- 
racy of  the  drainage  network  extraction  methods:  (i)  stream 
segments  digitized  from  Landsat  imagery  (TM  5,  year  2000, 
scene  id  LT50120542000087XXX02,  provided  by  the  United 
States  Geological  Survey  -  USGS)  and  (ii)  GPS  field  mea- 
surements. Bands  3,  4,  and  5  of  the  Landsat  TM  scene  were 
used  at  its  native  resolution  of  30  m.  The  geographic  projec- 
tion for  this  study  (UTM  1 7  North)  was  identical  to  the  pro- 
jection  of  this  Landsat  scene,  i.e.  the  Landsat  grid  geometry 
and  values  were  not  modified.  A  Landsat  false  color  com- 
posite with  Red,  Green,  and  Blue  assigned  to  the  channels 
4,  5,  and  3  respectively,  clearly  separated  vegetation  from 
waterbodies  and  provided  the  background  for  manual  digiti- 
zation of  points  along  the  river  centerlines.  Only  streams  and 
rivers  with  a  width  less  or  equal  to  4  grid  cells  (120  m)  were 
digitized  in  order  to  exclude  lakes,  anabranching  or  braided 
rivers,  and  rivers  too  broad  to  reliably  determine  a  stream 
center  using  Landsat  imagery  (Fig.  3).  The  horizontal  ac- 
curacy of  these  manually  digitized  reference  points  is  given 
by  the  imagery  resolution  of  30  m  or  better,  depending  on 
the  stream  width  and  the  location  of  the  stream  centerline  in 
relation  to  the  grid  cells. 

GPS  points  were  collected  in  the  field  using  a  Corval- 
lis  Microtechnology  March  v3.7  GPS  unit  with  ~2m  posi- 
tional accuracy.  Sites  in  the  Chagres  river  watershed  were 
measured  during  the  years  2002  to  2007  and  sites  at  lower 
reaches  of  most  major  rivers  across  Panama  were  measured 
during  2005  and  2009  years.  GPS  points  were  collected 
along  clearly  identifiable  perennial  rivers  at  locations  where 
a  reliable  GPS  signal  was  available.  The  GPS  measurements 
included  a  larger  proportion  of  points  acquired  in  mountain- 
ous regions  along  smaller  rivers,  while  most  of  the  points 
located  along  larger  rivers  in  low-gradient  flood  plain  and 
coastal  plain  landscapes  were  digitized  from  Landsat  im- 
agery (Fig.  3). 

The  SRTM  water  body  database  was  not  used  for  the  ac- 
curacy evaluation  because  this  cannot  be  regarded  as  an  in- 
dependent data  source  since  it  was  used  in  the  creation  of  the 
SRTM  DEM  v2,  and  because  even  the  larger  rivers  visible  on 
Landsat  imagery  were  still  too  narrow  to  be  captured  in  the 
SRTM  water  body  database. 

We  have  also  considered  and  rejected  the  use  of  hydrog- 
raphy (blue  lines)  from  topographic  maps  because  of  their 
low  reliability  and  a  relatively  coarse  scale  of  1 : 50  000.  Sev- 
eral studies  that  have  compared  the  blue  lines  with  on-ground 
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Fig.  4.  Extraction  of  a  single  main  channel:  (A)  Location  of  areas  shown  in  images  (B),  (C),  and  (D);  (B)  The  starting  points  of  the  extracted 
streams  (red  lines)  were  well  upstream  of  the  reference  points  (black  crosses).  Flow  accumulation  derived  from  IFSARE  10  m  is  used  as 
background.  The  extracted  streams  (red  lines)  are  located  within  broader  stream  tubes  in  low-gradient  areas  close  to  the  coast,  for  both  (C) 
LCP-derived  streams  and  (D)  streams  derived  with  sink  filling.  IFSARE  10  m  was  used  for  both  (C)  and  (D)  representing  the  same  area. 


surveys  of  existing  streams  demonstrated  that  the  blue  lines 
were  the  least  accurate  source  of  stream  location  informa- 
tion (see  e.g.  Colson,  2006  and  the  references  therein),  of- 
ten based  on  old  data  digitized  from  aerial  photographs  with 
large  errors  in  locations  where  the  streams  were  not  visible, 
due  to  the  presence  of  vegetation.  In  fact,  one  of  the  major 
motivations  for  deriving  the  drainage  network  from  the  IF- 
SARE data  in  Panama  was  the  inadequate  scale  of  available 
blue  lines  with  many  larger  streams  missing  and  with  errors 
in  the  mapped  streams  location.  Until  recently,  the  relative 
inaccessibility  of  some  regions  (e.g.  the  upper  Chagres  river) 
and  an  incessant  cloud  cover  have  discouraged  mapping  in 
these  areas  (Kinner  et  al.,  2005). 

2.4    Flow  accumulation  and  evaluation  of  extracted 
drainage  networks 

Several  tests  were  performed  to  evaluate  the  tested  flow  rout- 
ing algorithms.  The  first  test  evaluated  the  processing  speed 
and  capability  to  handle  large  DEMs  when  computing  flow 
accumulation  maps.  In  this  test,  the  performance  of  the  new 
LCP  implementation  was  compared  with  the  old  r.watershed 
module  (GRASS  Development  Team,  2008)  and  with  a  rel- 
atively new,  I/O-eflficient  algorithm,  used  in  the  module 


r.terraflow  (Arge  et  al.,  2003).  All  computations  were  done 
on  the  same  hardware  (AMD  Athlon  X2  3  GHz  and  8  GB 
RAM)  but  different  operating  systems  (Linux  64  bit  for  the 
GRASS  modules  r.watershed  and  rterraflow,  and  Windows 
XP  32  bit  for  TAS  GIS  which  is  a  32  bit  application  for  Mi- 
crosoft Windows  and  was  not  available  as  64  bit  application 
at  the  time  of  writing). 

The  second  test  evaluated  the  horizontal  accuracy  of  the 
extracted  drainage  paths  for  algorithms  with  different  han- 
dling of  depressions.  Three  approaches  were  tested:  (i)  sink 
filling  as  implemented  in  rterraflow,  (ii)  the  impact  reduction 
approach  (IRA)  as  implemented  in  TAS  GIS  (Lindsay  and 
Creed,  2005),  and  (iii)  the  LCP  algorithm  as  implemented  in 
the  enhanced  version  of  r.watershed  in  GRASS  GIS  6.4.  To 
test  the  sink  filling  method,  rterraflow  was  used  because  of 
its  capability  to  efiiciently  process  massive  DEMs. 

The  third  test  evaluated  the  impact  of  DEM  resolution 
and  level  of  detail  on  the  horizontal  accuracy  of  the  ex- 
tracted drainage  paths  for  all  tested  algorithms  (r.watershed, 
rterraflow,  and  TAS). 

The  tests  were  performed  for  the  central  Panama  region 
using  both  the  IFSARE  and  SRTM  data  at  the  10  m,  30  m 
and  90  m  resolutions  by  computing  flow  accumulation  with 
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MFD  dispersal  and  extracting  drainage  networks  from  the 
computed  flow  accumulation. 

Drainage  networks  were  extracted  from  all  MFD  flow  ac- 
cumulation maps  using  a  minimum  upstream  catchment  area 
of  100  000  m^  as  a  threshold.  This  threshold  was  selected 
for  testing  purposes  and  to  ensure  that  all  measured  or  dig- 
itized points  were  located  downstream  from  the  extraction 
starting  point  and  that  all  points  had  a  drainage  path  associ- 
ated with  it  (Fig.  4b).  Accurate  identification  of  the  channel 
head  zones  was  beyond  the  scope  of  this  paper  because  it  re- 
quires more  than  elevation  data  as  it  is  often  significantly  in- 
fluenced by  local  geology,  groundwater  level,  and  land  cover 
(North  Carolina  Division  of  Water  Quality  NCWQ,  2010). 
According  to  the  NCWQ  (2010)  study,  "the  stream  origins 
usually  occur  as  transition  zones  in  which  the  location  and 
length  of  the  zone  is  subject  to  fluctuations  in  groundwater 
levels  and  precipitation".  The  tested  90  m  and  30  m  resolu- 
tion SRTM  DEMs  measured  over  at  least  30  m  high  canopy 
of  dense  tropical  forest  did  not  provide  sufficient  informa- 
tion for  accurate  identification  and  mapping  of  the  dynamic 
channel  heads  zones.  Therefore  our  focus  was  on  mapping 
streams  and  rivers  that  have  sufficiently  large  contributing 
areas  that  the  existence  of  a  well  defined  stream  can  be  ex- 
pected. 

In  order  to  extract  a  single  main  channel  in  broader  flood- 
plains  where  MFD  flow  accumulation  generates  several  cells 
wide  stream  tubes  that  may  include  cells  above  the  given 
threshold,  a  new  stream  starting  point  was  defined  only  if 
it  was  not  located  within  such  a  stream  tube  (Fig.  4).  Down- 
stream channel  tracing  followed  the  steepest  slope  and  max- 
imum flow  accumulation  within  the  stream  tube.  To  egress 
fi-om  depressions  using  the  LCP  method,  streams  again  fol- 
lowed the  maximum  flow  accumulation  which  was  in  these 
cases  identical  to  the  drainage  direction  initially  determined 
during  the  LCP  search.  As  a  result,  the  drainage  paths  ex- 
tracted from  MFD  flow  accumulation  were  a  single  grid  cell 
wide  and  were  easily  vectorized  for  further  analysis.  This 
approach  of  extracting  single  cell  wide  dendritic  and  topo- 
logically  correct  drainage  networks  from  flow  accumulation 
obtained  with  any  kind  of  flow  distribution  method  avoids 
the  problems  associated  with  skeletonizing  rasterized  MFD 
channels.  The  MFD  capabilities  to  map  flow  dispersal  in 
floodplains  can  then  be  preserved  and  single  flow  direction 
(SFD)  routing  is  not  necessary  for  flow  accumulation  and 
subsequent  channel  extraction  (although  it  remains  an  op- 
tion). 

Assessment  of  drainage  network  extraction  accuracy 
was  then  performed  by  calculating  distances  between  the 
drainage  paths  derived  from  the  IF  S ARE  and  SRTM  DEMs 
and  the  points  measured  along  the  streams,  consisting  of  338 
on-ground  GPS  measurements  and  995  points  digitized  along 
rivers  from  the  Landsat  imagery  (Figs.  3,4). 

In  order  to  determine  which  method  or  level  of  detail  pro- 
vided more  accurate  results,  we  determined  the  distance  of 
reference  points  to  the  nearest  extracted  drainage  path.  The 


difference  in  the  distance  to  reference  points  between  meth- 
ods or  level  of  detail  was  then  used  to  assess  which  method 
or  level  of  detail  provided  more  accurate  results.  Distance 
differences  were  tested  with  two-tailed  Student's  t-tests  for 
statistical  significance  and  a  =  0.05.  If  distance  differences 
were  significantly  different  from  zero,  i.e.  one  of  the  tested 
methods/levels  of  detail  was  more  accurate  than  the  other,  the 
sign  of  the  difference  indicated  which  method/level  of  detail 
was  more  accurate. 


3  Results 

3.1    Performance  comparison 

To  demonstrate  the  improvement  in  computational  perfor- 
mance of  the  new  LCP  method  implementation,  the  process- 
ing times  needed  by  the  old  and  new  version  of  r.watershed, 
and  r.terraflow  were  compared,  with  the  results  summarized 
in  Table  1.  For  flow  accumulation  computation,  the  new 
version  was  350  times  faster  than  the  old  version  for  the 
central  Panama  area  represented  by  a  relatively  small  DEM 
with  2  million  grid  cells  at  90  m  resolution.  The  improve- 
ment was  even  more  dramatic  for  the  countrywide  coverage 
at  90  m  resolution  with  27  million  grid  cells:  the  new  version 
was  about  1750  times  faster.  Although  absolute  processing 
times  are  dependent  on  the  specific  hardware  used  for  test- 
ing, we  expect  that  relative  time  differences  will  be  simi- 
lar on  other  systems  because  the  software  optimizations  are 
hardware-independent.  Processing  the  larger  regions  used 
in  this  study  with  the  old  version  was  impractical  because 
it  could  easily  take  days  (Arge  et  al.,  2003),  whereas  the  new 
version  required  only  minutes  to  execute.  The  computational 
time  needed  by  the  I/O-eflficient  r.terraflow  was  significantly 
lower  than  that  for  the  old  rwatershed,  but  the  r.terraflow 
module  was  not  as  fast  as  the  new  LCP  implementation  on 
our  test  system  (Table  1).  We  have  also  considered  the  case 
when  the  data  do  not  fit  into  memory  and  rwatershed  uses 
segmented  processing,  with  intermediate  data  being  stored 
on  disk.  This  leads  to  longer  processing  times  than  for  the 
all-in-memory  mode.  Processing  times  observed  on  our  test- 
ing system  in  the  segmented  mode  were  still  shorter  than  for 
r.terraflow,  which  uses  I/O-eflficient  algorithms  specifically 
designed  for  large  datasets  (Arge  et  al.,  2003).  The  size  of 
intermediate  data  created  by  rwatershed  in  the  segmented 
mode  was  less  than  20%  of  those  created  by  r.terraflow.  This 
can  be  of  advantage  if  intermediate  data  are  just  a  bit  too  large 
to  fit  into  memory.  It  is  important  to  note  that,  theoretically, 
r.terraflow  with  its  advanced  I/O-eflficient  algorithms  should 
be  faster  than  rwatershed  for  DEMs  much  larger  than  the 
ones  processed  here  and  on  systems  with  less  physical  mem- 
ory. 

TAS  GIS  was  not  included  in  the  performance  compar- 
isons, because  its  memory  requirements  for  processing  the 
10  m  IFSARE  DEM  and  the  30  m  SRTM  DEM  for  all  of 
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Table  1.  Processing  time  for  the  different  DEMs  on  a  Linux  64  bit  system  with  an  AMD  Athlon  X2  3  GHz  CPU  and  8  GB  RAM. 


SRTM  90  m 
Central  Panama 
1.9  million  cells 

IFSARESOm 
central  Panama 
17  million  cells 

SRTM  90  m 
all  of  Panama 
27  million  cells 

IPS  ARE  10  m 
central  Panama 
156  million  cells 

SRTM  30  m 
all  of  Panama 
241  million  cells 

old  r.watershed, 
all  in  memory 

23.2min 

23h  lOmin 

30h46  min 

>  2  days 
not  measured 

>  2  days 
not  measured 

new  r.watershed, 
all  in  memory 

0.07  min 

0.83  min 

1.05  min 

9.97  min 

12.3  min 

new  r.watershed, 
data  on  disk 

0.15min 

1.73  min 

2.05  min 

30.7  min 

32. 18  min 

r.terraflow 

0.45  min 

4.4  min 

5.05  min 

66.95  min 

58.7  min 

Table  2.  Median  distances  of  reference  points  to  nearest  extracted 
stream  in  meters. 


GPS 

Landsat 

filling 

IRA 

LCP 

filling 

IRA 

LCP 

IFSARE  10  m 

62.65 

N/A* 

34.40 

43.75 

N/A* 

39.25 

IFSARE  30  m 

77.15 

71.42 

70.71 

48.88 

48.27 

47.75 

SRTM  30  m 

99.78 

92.46 

89.13 

84.25 

82.81 

83.72 

SRTM  90  m 

124.84 

119.69 

121.00 

94.30 

94.69 

94.92 

*  Computation  cancelled  by  software  with  out  of  memory  error. 
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Fig.  5.  Mean  and  standard  deviation  of  the  distance  differences  be- 
tween sink  filling  and  least-cost  path  search.  A  positive  distance  dif- 
ference indicates  that  streams  extracted  with  least-cost  path  search 
are  closer  to  reference  points  than  streams  extracted  with  sink  fill- 
ing. 


Fig.  6.  Mean  and  standard  deviation  of  the  distance  differences 
between  impact  reduction  approach  and  least-cost  path.  A  posi- 
tive distance  difference  indicates  that  streams  extracted  with  least- 
cost  path  search  are  closer  to  reference  points  than  streams  extracted 
with  the  impact  reduction  approach. 


lize,  hence  the  out-of-memory  error  was  regarded  as  a  lim- 
itation of  this  application  and  not  as  a  hardware  limitation. 
Therefore,  TAS  was  used  only  for  the  sink  treatment  meth- 
ods in  comparison  with  the  30  m  and  90  m  DEM  covering 
the  central  Panama  subregion  (Fig.  2)  and  results  of  the  IRA 
method  were  not  available  for  10  m  IFSARE.  At  the  time  of 
writing,  IRA  was  not  included  in  WhiteboxGAT  (Lindsay, 
2009)  that  has  replaced  TAS. 

3.2    Comparison  between  different  methods  of 
sink  treatment 


Panama  exceeded  available  system  memory.  TAS  GIS  is  a 
32  bit  application  for  Microsoft  Windows  and  can  only  uti- 
lize as  much  memory  as  a  32  bit  application  can  manage, 
i.e.  3  GB.  As  opposed  to  r.terraflow  and  r.watershed,  TAS 
GIS  keeps  all  intermediate  data  in  memory,  thus  limiting  the 
size  of  the  DEMs  that  can  be  processed.  The  test  system  pro- 
vided as  much  memory  as  TAS  GIS  could  theoretically  uti- 


To  evaluate  the  impact  of  different  approaches  in  the  han- 
dling of  elevation  surface  depressions,  median  distances  of 
reference  points  to  extracted  drainage  paths  for  each  sink 
treatment  method  and  each  processed  DEM  were  computed 
and  the  results  are  summarized  in  Table  2.  As  expected,  the 
location  of  extracted  drainage  paths  improved  with  increas- 
ing detail  available  in  the  DEM.  While  the  drainage  paths  for 
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Fig.  7.  Mean  and  standard  deviation  of  the  distance  differences 
between  the  30  m  SRTM  DEM  and  the  30  m  IFSARE  DEM.  A  pos- 
itive distance  difference  indicates  that  streams  extracted  from  the 
30  m  IFSARE  DEM  are  closer  to  reference  points  than  streams  ex- 
tracted from  the  30  m  SRTM  DEM.  LCP:  least-cost  path  search; 
IRA:  impact  reduction  approach. 
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Fig.  8.  Mean  and  standard  deviation  of  the  distance  differences  be- 
tween the  90  m  and  the  30  m  SRTM  DEM.  A  positive  distance  dif- 
ference indicates  that  streams  extracted  from  the  30  m  SRTM  DEM 
are  closer  to  reference  points  than  streams  extracted  from  the  90  m 
SRTM  DEM.  LCP:  least-cost  path  search;  IRA:  impact  reduction 
approach. 


the  90  m  SRTM  DEM  were  about  100  m  away  from  refer- 
ence points,  the  median  distances  improved  down  to  34.4  m 
for  the  IFSARE  10  m  DEM. 

Drainage  paths  extracted  from  the  LCP  flow  accumulation 
were  closer  to  the  GPS  field  points  (t-tests,  all  ^337  <  —5.4,  all 
p  <  0.0001)  for  all  DEMs  at  all  tested  resolutions  (IFSARE 
10  m,  30  m,  and  SRTM  30  m,  90  m)  when  compared  with  the 
sink  filling  method  (Fig.  5  showing  bar  graphs  with  mean  and 
standard  deviation  of  the  distance  differences).  The  drainage 
paths  extracted  from  LCP  were  also  closer  to  the  points  dig- 
itized from  Landsat  for  all  DEMs  (t-tests,  all  ^994  <  —2.3, 
all  p  <  0.0001),  except  for  the  90  m  SRTM  where  the  dis- 
tance difference  was  not  significant  for  (t-test,  ^994  =  —  1 .213, 
/7  =  0.23). 

When  compared  to  the  IRA  method  (Fig.  6  showing  bar 
graphs  with  mean  and  standard  deviation  of  the  distance  dif- 
ferences), drainage  paths  extracted  from  LCP  flow  accumula- 
tion were  closer  to  the  GPS  field  points  for  the  30  m  IFSARE 


(t-test,  ^337  =  -2.361,  p  =  0.018)  and  30m  SRTM  (t-test, 
^337  =  -3.088,  p  =  0.002),  but  for  the  90m  SRTM  the  dif- 
ference was  not  significant  (t-test,  ^337  =  —1.745,  p  =  0.08). 
For  the  points  digitized  from  Landsat,  drainage  paths  ex- 
tracted from  LCP  accumulation  were  not  closer  than  IRA 
drainage  paths  for  any  of  the  tested  DEMs  (t-tests,  all  ab- 
solute ^994  <  1.0,  all  p  >  0.3). 

3.3  Comparison  between  different  levels  of  detail 
at  constant  resolution 

The  10  m  IFSARE  and  90  m  SRTM  were  interpolated  to  the 
same  resolution  of  30  m.  We  investigated  how  much  the  dif- 
ferent levels  of  detail  at  identical  resolution  influenced  the  ac- 
curacy of  extracted  drainage  paths  for  all  three  sink  treatment 
methods  (Fig.  7  showing  bar  graphs  with  mean  and  standard 
deviation  of  the  distance  differences).  The  distance  between 
the  reference  points  and  drainage  paths  extracted  from  the 
30  m  IFSARE  DEM  was  subtracted  from  the  distance  be- 
tween the  reference  points  and  drainage  paths  extracted  from 
the  30  m  SRTM  DEM  to  measure  the  difference  in  accuracy 
of  drainage  paths  extracted  from  these  two  DEMs.  For  larger 
rivers  in  flat  areas  (Landsat  points),  the  accuracy  of  drainage 
path  locations  was  considerably  higher  for  the  30  m  IFSARE 
DEM  than  for  the  30  m  SRTM  DEM  (t-test,  all  ^994  <  - 13.0, 
all  p  <  0.001;  on  average  35  m  closer)  for  all  methods.  For 
smaller  rivers  not  broader  than  30  m  in  mountainous  regions 
(GPS  points),  only  IRA  showed  a  significant  improvement 
in  accuracy  from  the  30  m  SRTM  DEM  to  the  30  m  IFSARE 
DEM  (t-test,  ^337  =  -3.315,  j9  =  0.001,  all  other  ^337  >  -1.9 
and  <0,  p  >  0.05).  However,  as  shown  in  the  previous  sec- 
tion for  the  30  m  IFSARE  DEM,  IRA  drainage  path  locations 
in  mountainous  regions  were  significantly  less  accurate  than 
LCP  drainage  path  locations. 

3.4  Effects  of  RST  resampling  on  the  accuracy  of 
drainage  network  extraction 

In  order  to  assess  the  effect  of  SRTM  resampling  from  90  m 
to  30  m  resolution  by  the  RST  method,  the  drainage  paths 
extracted  from  the  90  m  SRTM  DEM  were  compared  with 
drainage  paths  extracted  from  the  resampled  30  m  SRTM 
DEM  by  calculating  the  difference  in  drainage  path  distances 
to  reference  points  (Fig.  8  showing  bar  graphs  with  mean  and 
standard  deviation  of  the  distance  differences).  Correspond- 
ing median  distances  of  reference  points  to  drainage  path  lo- 
cations are  given  in  Table  2.  The  accuracy  of  drainage  path 
locations  was  improved  by  resampling  for  larger  rivers  in  flat 
landscape  for  all  sink  treatment  methods  (t-tests,  all  ^994  < 
—6.5,  all  /?<  0.001),  however,  for  GPS  points  in  mountain- 
ous regions  only  the  results  obtained  by  the  least-cost  path 
search  were  improved  (t-test,  ^337  =  —3.734,  p  <  0.001,  all 
other  ^337  >  —  1  and  <0,  p  >  0.3). 
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Fig.  9.  A  typical  example  for  the  impact  of  sink  filling  along  a  section  of  a  river  course  extracted  from  the  30  m  SRTM  DEM.  Along  this 
section,  92%  of  all  elevation  values  were  modified.  Parts  of  the  river  course  obscured  by  trees  are  highlighted  in  the  GoogleEarth  sreenshot. 


Fig.  10.  Typical  effects  of  different  sink  treatment  methods  on  extracted  stream  courses  in  (A)  mountainous  areas  and  (B)  costal  plains.  Filled 
sinks  are  grayed.  Streams  extracted  from  sink-filling  in  red  are  overlaid  by  streams  extracted  from  least-cost  path  search  in  blue.  Points  of 
confluences  (yellow  cross)  in  mountainous  areas  can  be  shifted  considerably,  and  in  coastal  plains,  sink  filling  can  lead  to  long  straight  lines. 
A  shaded  relief  of  the  10  m  IFSARE  DEM  was  used  as  background. 


4  Discussion 

Digital  Terrain  Models,  as  an  approximation  of  real  terrain  at 
a  given  level  of  detail,  include  random  noise  and  errors.  For 
example,  an  in  depth  analysis  of  SRTM  accuracy  and  source 
of  errors  is  provided  by  Rodriguez  et  al.  (2006)  and  Farr  et 
al.  (2007).  The  errors  can  have  significant  impact  on  the  re- 
sults of  DEM  analysis  and  their  treatment  has  been  the  focus 
of  broad  research  (e.g.  Jenson  and  Domingue,  1988;  Lindsay 
and  Creed,  2005;  Hutchinson,  2000).  Sink  (depression)  re- 
moval was  designed  to  remove  small  depressions  introduced 
as  artifacts  of  elevation  data  processing  for  the  first  genera- 
tion of  DEMs  to  facilitate  continuous  flow  routing  (Jenson 
and  Domingue,  1988).  This  procedure,  referred  to  as  hydro- 
logical  conditioning,  has  become  widespread  even  for  next 
generations  of  elevation  data  and  is  now  a  pre-processing 


step  required  by  a  number  of  applications  for  hydrological 
analysis  and  modeling.  In  order  to  alleviate  the  problem  of 
routing  surface  flow  through  the  flat  areas  created  by  sink 
filling,  efforts  have  been  made  to  add  a  gradient  to  these  flat 
areas  (e.g.  Martz  and  Garbrecht,  1998;  Wang  and  Liu,  2006; 
Grimaldi  et  al.,  2007;  Santini  et  al.,  2009)  Nevertheless,  as 
modern  high  resolution  data  such  as  LiDAR  demonstrate, 
actual  terrain  includes  many  true  depressions,  especially  in 
riffle-and-pool  portions  of  mountainous  rivers,  and  on  flood- 
plains  and  coastal  plains  (Notebaert  et  al.,  2009).  There- 
fore, sink  filling  is  not  always  an  appropriate  approach  for 
hydrological  conditioning  of  a  DEM  because  the  measured 
elevation  data  are  replaced  with  new  elevation  values  based 
on  the  rather  unrealistic  assumption  that  the  terrain  has  no 
depressions  (Jenson  and  Domingue,  1988;  Tarboton,  1997), 
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be  they  artificial  or  real.  Although  many  hydrologic  mod- 
els require  depression-less  DEMs,  removal  of  sinks  can  alter 
the  true  hydrological  state  that  includes  standing  water  in  de- 
pressions. Moreover,  in  IFSARE  and  SRTM  DEMs  that  in- 
clude vegetation  cover,  many  of  the  observed  depressions  are 
caused  by  gaps  in  canopy,  canopy  closing  over  river  courses, 
or  by  vegetation  partially  obstructing  narrow  and  deep  val- 
leys. The  filling  of  such  features  requires  altering  elevations 
by  several  meters  and  over  areas  of  hundreds  of  m^,  leading 
to  large  flats  and  an  artificial  drainage  network  geometry  that 
does  not  represent  reality.  Figure  9  presents  an  example  of 
the  barriers  across  the  river  channel  created  by  overhanging 
trees  and  a  profile  that  illustrates  the  extensive  modification 
in  elevation  that  can  result  from  such  filling.  Preserving  most 
of  the  drainage  directions  of  the  original  DEM  is  preferable 
to  hydrological  conditioning  in  this  case. 

Our  results  suggest  that  the  adverse  effects  of  sink  filling 
become  more  pronounced  for  higher  resolution  DEMs  (see 
Fig.  10  for  drainage  paths  extracted  from  10  m  IFSARE). 
By  contrast,  at  lower  resolutions  such  as  90  m,  the  coarsest 
resolution  tested  in  this  study,  there  was  often  no  significant 
difference  in  drainage  path  locations  between  the  compared 
methods,  particularly  for  Landsat  reference  points.  Land- 
sat  points  where  predominantly  located  in  flood  plain  and 
coastal  plain  landscapes  where  both  the  horizontal  and  the 
vertical  resolution  of  90  m  SRTM  were  often  not  sufiicient  to 
locate  rivers  and  no  method  was  able  to  accurately  delineate 
drainage  paths  in  these  areas  from  90  m  SRTM  (Hancock  et 
al.,  2006;  Valeriano  et  al,  2006). 

When  comparing  SRTM  and  IFSARE  DEMs  at  the  same 
resolution  of  30  m,  the  drainage  networks  extracted  from 
the  IFSARE  DEM  were,  as  expected,  more  accurate  than 
drainage  networks  extracted  from  SRTM  DEM,  particularly 
for  Landsat  points  in  relatively  flat  areas.  As  shown  in  previ- 
ous studies  (Hancock  et  al.,  2006;  Valeriano  et  al.,  2006), 
the  SRTM  DEM  close  to  its  native  resolution  of  3  arc  sec 
does  not  provide  sufiicient  vertical  detail  to  determine  river 
courses  in  flat  areas.  The  applied  RST  resampling  method 
significantly  improved  the  accuracy  of  extracted  drainage 
path  locations  for  SRTM  in  regions  with  low  topography. 
Apparently  the  level  of  detail  has  been  not  just  increased  but 
also  improved  because  resampling  to  a  higher  horizontal  (and 
for  SRTM  also  higher  vertical)  resolution  seems  to  be  help- 
ful in  improving  the  results  of  watershed  analysis  from  low- 
resolution  DEMs  (see  also  Valeriano  et  al.,  2006).  The  global 
SRTM  coverage  at  3  arc  sec  horizontal  resolution  has  been 
created  by  averaging  the  original  1  arc  sec  coverage  (Farr  et 
al.,  2007).  Depending  on  the  method  used,  reinterpolating 
the  3  arc  sec  version  back  to  1  arc  sec  can  result  in  narrowed 
valleys  and  ridges  and  increased  channel  sinuosity.  Interest- 
ingly, only  the  LCP  search  method  benefited  from  the  reinter- 
polation  in  mountainous  regions  and  provided  here  the  most 
accurate  drainage  path  locations  for  the  30  m  SRTM  DEM. 

It  is  important  to  note  that  the  differences  in  the  tested 
methods  were  significant  due  to  the  particularly  challenging 


study  region.  Smaller  differences  between  the  methods  could 
be  expected  in  topography  with  gentle  slopes  and  uniformly 
low  vegetation  cover.  Although  network  connectivity,  topol- 
ogy and  channel  morphometry  has  not  been  investigated,  the 
visual  inspection  of  the  results  indicates  that  there  were  dif- 
ferences between  the  results  obtained  by  different  methods 
and  resolutions  in  the  coastal  plain  areas.  None  of  the  tested 
DEMs  or  methods  produced  sufficiently  reliable  results  and 
higher  resolution  did  not  always  lead  to  correct  topology. 
These  results  are  similar  to  those  found  for  LiDAR-based 
DEMs  (Colson,  2006)  and  require  further  research. 

Although  the  tests  presented  here  demonstrated  the  per- 
formance of  the  LCP  method  for  radar-based  data,  the  new, 
improved  version  of  r.watershed  has  in  fact  been  developed 
using  LiDAR  data  provided  by  the  USGS  in  addition  to  the 
1  m  LiDAR  DEM  examples  and  data  provided  by  Neteler  and 
Mitasova  (2008).  The  presented  method  processes  a  DEM 
without  modifying  its  elevation  values  and  is  thus  to  a  de- 
gree (quantitatively  narrowed  by  this  study)  dependent  on 
the  reliability  of  the  provided  DEM.  The  LCP  search  ap- 
proach, despite  its  capability  to  route  flow  through  depres- 
sions, will  therefore,  like  other  methods,  fail  to  accurately 
extract  drainage  networks  if  the  input  DEM  deviates  too  far 
from  real  topography. 

5  Conclusions 

We  presented  a  method  for  fast  computation  of  flow  accu- 
mulation and  drainage  network  extraction  for  large  DEMs 
with  nested  depressions  and  evaluated  it  against  other  com- 
monly used  methods  for  sink  treatment.  The  accuracy  assess- 
ment using  ground  control  points  and  Landsat  satellite  im- 
agery provided  insight  into  the  accuracy  of  drainage  network 
extraction  from  radar  elevation  data  in  a  challenging  tropi- 
cal forest  environment  and  coastal  plain  setting.  The  results 
suggest  that  the  conceptually  simple  sink  filling  approach  is 
not  suitable  for  the  IFSARE  and  SRTM  DEMs  in  regions 
with  significant  vegetation  cover  whereas  both  the  tested  im- 
pact reduction  approach  of  Lindsay  and  Creed  (2005)  and  the 
LCP  search  provide  more  accurate  drainage  networks. 

The  performance  testing  has  demonstrated  that  the  new 
implementation  dramatically  improves  computational  efii- 
ciency  while  preserving  the  high  accuracy  of  the  LCP  rout- 
ing capabilities.  The  increase  in  computational  performance 
is  particularly  relevant  for  the  modem  mapping  technologies 
that  can  rapidly  produce  massive  DEMs  at  high  resolutions 
for  large  areas  and  the  slow  processing  and  analysis  has  be- 
come bottleneck  in  their  application.  Fast  processing  of  mas- 
sive DEMs  opens  their  application  to  a  new  level  of  detail  and 
spatial  extent,  for  example  in  rapid  response  operations  or  for 
mapping  in  remote  regions  where  the  streams  are  covered  by 
dense  vegetation  and  no  reliable  stream  data  are  available. 
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Abstract — We  address  the  percolation-based  connectivity  of 
large-scale  ad  hoc  heterogeneous  wireless  networks,  where  sec- 
ondary users  exploit  channels  temporarily  unused  by  primary 
users  and  the  existence  of  a  communication  link  between  two  sec- 
ondary users  depends  on  not  only  the  distance  between  them  but 
also  the  transmitting  and  receiving  activities  of  nearby  primary 
users.  We  introduce  the  concept  of  connectivity  region  defined  as 
the  set  of  density  pairs — the  density  of  secondary  users  and  the 
density  of  primary  transmitters  —  under  which  the  secondary 
network  is  connected.  Using  theories  and  techniques  from  con- 
tinuum percolation,  we  analytically  characterize  the  connectivity 
region  of  the  secondary  network  and  reveal  the  tradeoff  be- 
tween proximity  (the  number  of  neighbors)  and  the  occurrence 
of  spectrum  opportunities.  Specifically,  we  establish  three  basic 
properties  of  the  connectivity  region — contiguity,  monotonicity  of 
the  boundary  and  uniqueness  of  the  infinite  connected  component, 
where  the  uniqueness  implies  the  occurrence  of  a  phase  transition 
phenomenon  in  terms  of  the  almost  sure  existence  of  either  zero 
or  one  infinite  connected  component;  we  identify  and  analyze 
two  critical  densities  which  jointly  specify  the  profile  as  well  as 
an  outer  bound  on  the  connectivity  region;  we  study  the  im- 
pacts of  secondary  users'  transmission  power  on  the  connectivity 
region  and  the  conditional  average  degree  of  a  secondary  user 
and  demonstrate  that  matching  the  interference  ranges  of  the 
primary  and  the  secondary  networks  maximizes  the  tolerance  of 
the  secondary  network  to  the  primary  traffic  load.  Furthermore, 
we  establish  a  necessary  condition  and  a  sufficient  condition  for 
connectivity,  which  lead  to  an  outer  bound  and  an  inner  bound  on 
the  connectivity  region. 

Index  Terms — Cognitive  radio,  connectivity  region,  continuum 
percolation,  critical  densities,  heterogeneous  wireless  network, 
phase  transition. 


1.  Introduction 

THE  communication  infrastructure  is  becoming  increas- 
ingly heterogeneous,  with  a  dynamic  composition  of  in- 
terdependent, interactive  and  hierarchical  network  components 
with  different  priorities  and  service  requirements.  One  example 
is  the  cognitive  radio  technology  [1]  for  opportunistic  spectrum 
access  which  adopts  a  hierarchical  structure  for  resource  sharing 
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[2].  Specifically,  a  secondary  network  is  overlaid  with  a  pri- 
mary network,  where  secondary  users  identify  and  exploit  tem- 
porarily and  locally  unused  channels  without  causing  unaccept- 
able interference  to  primary  users  [2]. 

A.  Connectivity  and  Connectivity  Region 

While  the  connectivity  of  homogeneous  ad  hoc  networks  con- 
sisting of  peer  users  has  been  well  studied  (see,  for  example, 
[3]-[10]),  little  is  known  about  the  connectivity  of  heteroge- 
neous networks.  The  problem  is  fundamentally  different  from 
its  counterpart  in  homogeneous  networks.  In  particular,  the  con- 
nectivity of  the  low-priority  network  component  depends  on  the 
characteristics  (traffic  pattern/load,  topology,  interference  toler- 
ance, etc.)  of  the  high-priority  component,  thus  creating  a  much 
more  diverse  and  complex  design  space. 

Due  to  the  hierarchical  structure  of  spectrum  sharing,  a  com- 
munication link  exists  between  two  secondary  users  if  the  fol- 
lowing two  conditions  hold:  (CI)  they  are  within  each  other's 
transmission  range;  (C2)  they  see  a  spectrum  opportunity  de- 
termined by  the  transmitting  and  receiving  activities  of  nearby 
primary  users  (see  Section  II-B-I).  We  say  that  a  topological  link 
exists  between  two  secondary  users  if  only  condition  (CI)  holds. 

Using  theories  and  techniques  from  continuum  percolation, 
we  analytically  characterize  the  percolation-based  connectivity 
of  the  secondary  network  in  a  large-scale  ad  hoc  heterogeneous 
network.  Specifically,  we  consider  a  Poisson  distributed  sec- 
ondary network  overlaid  with  a  Poisson  distributed  primary 
network  in  an  infinite  two-dimensional  Euclidean  space^.  We 
define  network  connectivity  as  the  existence  of  an  infinite 
connected  component  formed  by  communication  links  almost 
surely  (a.s.),  i.e.,  the  occurrence  of  percolation.  This  connec- 
tivity characterizes  the  accessibility  of  two  secondary  users  at 
a  given  instant  and  is  thus  also  called  instantaneous  mutual 
connectivity.  Given  the  transmission  power  and  the  interfer- 
ence tolerance  of  both  the  primary  and  the  secondary  users,  the 
connectivity  of  the  secondary  network  depends  on  the  density 
of  secondary  users  [due  to  (CI)]  and  the  traffic  load  of  primary 
users  [due  to  (C2)] . 

We  thus  introduce  the  concept  of  connectivity  region  C,  de- 
fined as  the  set  of  density  pairs  (Xsj^pt)  under  which  the  sec- 
ondary network  is  connected,  where  Xs  denotes  the  density  of 
the  secondary  users  and  A pt  the  density  of  primary  transmitters 
(representing  the  traffic  load  of  the  primary  users).  As  illustrated 
in  Fig.  1,  a  secondary  network  with  a  density  pair  (A 5,  Apt)  in- 
side the  shaded  region  is  connected:  the  secondary  network  has 

^This  infinite  network  model  is  equivalent  in  distribution  to  the  limit  of  a 
sequence  of  finite  networks  with  a  fixed  density  as  the  area  of  the  network  in- 
creases to  infinity,  i.e.,  the  so-called  extended  network  [1 1].  It  follows  from  the 
arguments  similar  to  the  ones  used  in  [12,  Chapter  3]  for  homogeneous  ad  hoc 
networks  that  this  infinite  ad  hoc  heterogeneous  network  model  represents  the 
limiting  behavior  of  large-scale  networks. 
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Fig.  1.  Connectivity  region  C  (the  upper  boundary  Ap^(A5)  is  defined  as  the 
supremum  density  of  the  primary  transmitters  that  ensures  connectivity  with  a 
fixed  density  of  the  secondary  users;  the  critical  density  A  J  of  the  secondary 
users  is  defined  as  the  infimum  density  of  the  secondary  users  that  ensures  con- 
nectivity under  a  positive  density  of  the  primary  transmitters;  the  critical  density 
Ap^  of  the  primary  transmitters  is  the  supremum  density  of  the  primary  trans- 
mitters that  ensures  connectivity  with  sl  finite  density  of  the  secondary  users). 

a  giant  connected  component  which  includes  infinite  number  of 
secondary  users.  The  existence  of  the  giant  connected  compo- 
nent enables  bidirectional  communications  between  distant  sec- 
ondary users  via  multihop  relaying.  On  the  other  hand,  a  sec- 
ondary network  with  a  density  pair  (A 5,  Apt)  outside  this  re- 
gion is  not  connected:  the  network  is  separated  into  an  infinite 
number  of  finite  connected  components.  Consequently,  any  sec- 
ondary user  can  only  communicate  with  users  within  a  limited 
range. 

It  is  shown  in  [13]  and  [14]  that  when  the  primary  network  is 
static,  this  instantaneous  connectivity  is  equivalent  to  the  con- 
nectivity defined  in  terms  of  the  finiteness  of  the  minimum  mul- 
tihop delay  (referred  to  as  fd-connectivity);  when  the  primary 
traffic  has  temporal  dynamics  (no  matter  how  small  the  range 
of  the  dynamics  is),  this  instantaneous  connectivity  is  stronger 
than  the  fd-connectivity  and  the  secondary  network  can  be  in- 
termittently connected  as  long  as  it  is  topologically  connected 
[13],  [14]. 

The  objective  of  this  paper  is  to  establish  analytical  charac- 
terizations of  the  connectivity  region  and  to  study  the  impact  of 
system  design  parameters  (in  particular,  the  transmission  power 
of  the  secondary  users)  on  network  connectivity.  The  main  re- 
sults are  summarized  in  the  subsequent  two  subsections. 

B.  Analytical  Characterizations  of  the  Connectivity  Region 

We  first  establish  three  basic  properties  of  the  connectivity  re- 
gion: contiguity,  monotonicity  of  the  boundary  and  uniqueness 
of  the  infinite  connected  component.  Specifically,  based  on  a 
coupling  argument,  we  show  that  the  connectivity  region  is  a 
contiguous  area  bounded  below  by  the  A^-axis  and  bounded 
above  by  a  monotonically  increasing  function  Ap^(A5)  (see 
Fig.  1),  where  the  upper  boundary  Ap^(A5)  is  defined  as 

Xpt{^s)  =  supjApT  :  5(A5,Apt)  is  connected}  (1) 

with  G{Xsy  Apt)  denoting  the  secondary  network  of  density  A5 
overlaid  with  a  primary  network  specified  by  the  density  Apt 
of  the  primary  transmitters.  The  uniqueness  of  the  infinite  con- 
nected component  is  established  based  on  ergodic  theory  and 
certain  combinatorial  results.  It  shows  that  once  the  secondary 


network  is  connected,  it  contains  a  unique  infinite  connected 
component  a.s. 

Second,  we  identify  and  analyze  two  critical  parameters  of 
the  connectivity  region:  A^  and  Ap^.  They  jointly  specify  the 
profile  as  well  as  an  outer  bound  on  the  connectivity  region.  Re- 
ferred to  as  the  critical  density  of  the  secondary  users,  A^  is  the 
infimum  density  of  the  secondary  users  that  ensures  connectivity 
under  a  positive  density  of  the  primary  transmitters 

Xg  =  inf{A5  :  3Apt  >  0  s.t.  Q{Xs^  Apt)  is  connected}. 

(2) 

We  show  that  A  J  equals  the  critical  density  Ac  of  a  homogeneous 
ad  hoc  network  (i.e.,  in  the  absence  of  primary  users),  which 
has  been  well  studied  [15].  This  result  shows  that  the  "takeoff 
point  in  the  connectivity  region  is  completely  determined  by  the 
effect  of  proximity — the  number  of  neighbors  (nodes  within  the 
transmission  range  of  a  secondary  user). 

Referred  to  as  the  critical  density  of  the  primary  transmitters, 
Ap^  is  the  supremum  density  of  the  primary  transmitters  that 
ensures  the  connectivity  of  the  secondary  network  with  a  finite 
density  of  the  secondary  users: 

App  =  supjApT  :  3A5  <  00  s.t.  Q{Xs^Xpt)  is  connected}. 

  (3) 

We  obtain  an  upper  bound  on  App,  which  is  shown  to  be  achiev- 
able in  simulations.  More  importantly,  this  result  shows  that 
when  the  density  of  the  primary  transmitters  is  higher  than  the 
(finite)  value  given  by  this  upper  bound,  the  secondary  network 
cannot  be  connected  no  matter  how  dense  it  is.  This  parameter 
App  thus  characterizes  the  impact  of  opportunity  occurrence 
on  the  connectivity  of  the  secondary  network:  when  the  den- 
sity of  the  primary  transmitters  is  beyond  a  certain  level,  there 
are  simply  not  enough  spectrum  opportunities  for  any  secondary 
network  to  be  connected. 

Since  a  precise  characterization  of  the  upper  boundary 
Apt(A5)  of  the  connectivity  region  is  intractable,  we  establish 
a  necessary  and  a  sufficient  condition  for  connectivity  to  pro- 
vide an  outer  and  an  inner  bound  on  the  connectivity  region. 
The  necessary  condition  is  expressed  in  the  form  of  the  condi- 
tional average  degree  of  a  secondary  user  and  is  derived  by  the 
construction  of  a  branching  process.  The  sufficient  condition 
is  obtained  by  the  discretization  of  the  continuum  percolation 
model  into  a  dependent  site  percolation  model. 

C.  Impact  of  Transmission  Power  on  Connectivity:  Proximity 
Versus  Opportunity 

The  study  of  the  impact  of  the  secondary  users'  transmis- 
sion power  ptx  on  network  connectivity  reveals  an  interesting 
tradeoff  between  proximity  and  opportunity  in  the  design  of 
heterogeneous  networks.  As  illustrated  in  Fig.  2,  we  show 
that  increasing  ptx  enlarges  the  connectivity  region  C  along 
the  A^-axis  (i.e.,  better  proximity  leads  to  a  smaller  "takeoff 
point),  but  at  the  price  of  reducing  C  along  the  ApT-axis. 
Specifically,  with  a  large  ptx,  few  secondary  users  experience 
spectrum  opportunities  due  to  their  large  interference  footprint 
with  respect  to  the  primary  users.  This  leads  to  a  poor  tolerance 
to  the  primary  traffic  load  parameterized  by  Apt- 

The  transmission  power  ptx  of  the  secondary  network  should 
thus  be  chosen  according  to  the  operating  point  of  the  hetero- 
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Fig.  2.  Simulated  connectivity  regions  for  two  different  transmission  powers 
(ptx  denotes  the  transmission  power  of  the  secondary  users  and  the  large  ptx  is 
3°"  times  the  small  ptx,  where  a  is  the  path-loss  exponent). 

geneous  network  given  by  the  density  of  the  secondary  users 
and  the  traffic  load  of  the  co-existing  primary  users.  Using  the 
tolerance  to  the  primary  traffic  load  as  the  performance  mea- 
sure, we  show  that  the  interference  range  rj  of  the  secondary 
users  should  be  equal  to  the  interference  range  Rj  of  the  pri- 
mary users  in  order  to  maximize  the  upper  bound  on  the  critical 
density  Ap^  of  the  primary  transmitters.  Given  the  interference 
tolerance  of  the  primary  and  secondary  users,  we  can  then  de- 
sign the  optimal  transmission  power  pt^  of  the  secondary  users 
based  on  that  of  the  primary  users. 

D.  Related  Work 

To  our  best  knowledge,  the  connectivity  of  large-scale  ad  hoc 
heterogeneous  networks  has  not  been  characterized  analytically 
or  experimentally  in  the  literature.  There  are  a  number  of  classic 
results  on  the  connectivity  of  homogeneous  ad  hoc  networks. 
For  example,  it  has  been  shown  that  to  ensure  either  1 -connec- 
tivity (there  exists  a  path  between  any  pair  of  nodes)  [3],  [4] 
or  /;:-connectivity  (there  exist  at  least  k  node-disjoint  paths  be- 
tween any  pair  of  nodes)  [5],  the  average  number  of  neighbors 
of  each  node  must  increase  with  the  network  size.  On  the  other 
hand,  to  maintain  a  weaker  connectivity — p-connectivity  (i.e., 
the  probability  that  any  pair  of  nodes  is  connected  is  at  least  p), 
the  average  number  of  neighbors  is  only  required  to  be  above  a 
certain  "magic  number'  which  does  not  depend  on  the  network 
size  [6]. 

The  theory  of  continuum  percolation  has  been  used  by 
Dousse  et  al  in  analyzing  the  connectivity  of  a  homogeneous 
ad  hoc  network  under  the  worst-case  mutual  interference  [7], 
[8].  In  [9],  [10],  the  connectivity  and  the  transmission  delay  in 
a  homogeneous  ad  hoc  network  with  static  or  dynamic  on-off 
links  are  investigated  from  a  percolation-based  perspective. 

In  [13]  and  [14],  the  impact  of  temporal  dynamics  of  primary 
traffic  on  the  connectivity  and  multihop  delay  of  the  secondary 
network  is  studied.  In  particular,  it  is  shown  that  in  the  presence 
of  primary  traffic  dynamics,  the  secondary  network  is  connected 


(either  instantaneously  or  intermittently)  as  long  as  it  is  topolog- 
ically  connected.  Optimal  power  control  in  heterogeneous  net- 
works has  been  studied  in  [16],  which  focuses  on  a  single  pair  of 
secondary  users  in  a  Poisson  network  of  primary  users.  The  im- 
pacts of  secondary  users'  transmission  power  on  the  occurrence 
of  spectrum  opportunities  and  the  reliability  of  opportunity  de- 
tection are  analytically  characterized. 

E.  Organization  and  Notations 

The  rest  of  this  paper  is  organized  as  follows.  Section  II 
presents  the  Poisson  model  of  the  heterogeneous  network.  In 
particular,  the  conditions  for  the  existence  of  a  communication 
link  in  the  secondary  network  is  specified  based  on  a  rigorous 
definition  of  spectrum  opportunity.  In  Section  III,  we  introduce 
the  concept  of  connectivity  region  and  establish  its  three  basic 
properties.  The  two  critical  densities  are  analyzed,  followed 
by  a  necessary  and  a  sufficient  condition  for  connectivity.  In 
Section  IV,  we  demonstrate  the  tradeoff  between  proximity 
and  opportunity  by  studying  the  impact  of  the  secondary  users' 
transmission  power  on  the  connectivity  region  and  on  the 
conditional  degree  of  a  secondary  user.  Further,  we  obtain  the 
optimal  transmission  power  of  the  secondary  users,  adopting 
the  secondary  network's  tolerance  of  the  primary  traffic  load  as 
the  performance  measure.  Section  V  contains  detailed  proofs 
of  the  main  results  and  Section  VI  concludes  the  paper. 

Throughout  the  paper,  we  use  capital  letters  for  parameters 
of  the  primary  users  and  lowercase  letters  for  those  of  the  sec- 
ondary users. 

II.  Network  Model 

We  consider  a  Poisson  distributed  secondary  network  over- 
laid with  a  Poisson  distributed  primary  network  in  an  infinite 
two-dimensional  Euclidean  space.  The  models  of  the  primary 
and  secondary  networks  are  specified  in  the  following  two  sub- 
sections. 

A.  The  Primary  Network 

The  primary  transmitters  are  distributed  according  to  a  two- 
dimensional  Poisson  point  process  with  density  Xpt-  Each  pri- 
mary transmitter's  receiver  is  uniformly  distributed  within  its 
transmission  range  Rp.  Here  we  have  assumed  that  all  primary 
transmitters  use  the  same  transmission  power  and  the  trans- 
mitted signals  undergo  isotropic  path  loss.  Based  on  the  dis- 
placement theorem  [17,  Chapter  5],  it  is  easy  to  see  that  the  pri- 
mary receivers  form  a  two-dimensional  Poisson  point  process 
with  density  Xpt-  Note  that  the  two  Poisson  processes  formed 
by  the  primary  transmitters  and  receivers  are  correlated. 

B.  The  Secondary  Network 

The  secondary  users  are  distributed  according  to  a  two-di- 
mensional Poisson  point  process  with  density  A5,  independent 
of  the  Poisson  processes  of  the  primary  transmitters  and  re- 
ceivers. The  transmission  range  of  the  secondary  users  is  de- 
noted by  Tp . 

Communication  Links:  In  contrast  to  the  case  in  a  homoge- 
neous network,  the  existence  of  a  communication  link  between 
two  secondary  users  depends  on  not  only  the  distance  between 
them  but  also  the  availability  of  the  communication  channel 
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Fig.  3.  Definition  of  spectrum  opportunity. 

(i.e.,  the  presence  of  a  spectrum  opportunity).  The  latter  is  deter- 
mined by  the  transmitting  and  receiving  activities  in  the  primary 
network  as  described  below. 

As  illustrated  in  Fig.  3,  there  exists  an  opportunity  from  A, 
the  secondary  transmitter,  to  B,  the  secondary  receiver,  if  the 
transmission  from  A  does  not  interfere  with  nearby  primary  re- 
ceivers in  the  solid  circle  and  the  reception  at  B  is  not  affected 
by  nearby  primary  transmitters  in  the  dashed  circle  [18].  Re- 
ferred to  as  the  interference  range  of  the  secondary  users,  the 
radius  r/  of  the  solid  circle  at  A  depends  on  the  transmission 
power  of  A  and  the  interference  tolerance  of  the  primary  re- 
ceivers, whereas  the  radius  Rj  of  the  dashed  circle  (the  interfer- 
ence range  of  the  primary  users^)  depends  on  the  transmission 
power  of  the  primary  users  and  the  interference  tolerance  of  B. 

It  is  clear  from  the  above  discussion  that  spectrum  opportuni- 
ties depend  on  both  transmitting  and  receiving  activities  of  the 
primary  users.  Furthermore,  spectrum  opportunities  are  asym- 
metric. Specifically,  a  channel  that  is  an  opportunity  when  A 
is  the  transmitter  and  B  the  receiver  may  not  be  an  opportu- 
nity when  B  is  the  transmitter  and  A  the  receiver.  In  other 
words,  there  exist  unidirectional  communication  links  in  the 
secondary  network.  Since  unidirectional  links  are  difficult  to  uti- 
lize in  wireless  networks  [19],  we  only  consider  bidirectional 
links  in  the  secondary  network  when  we  define  connectivity.  As 
a  consequence,  when  we  determine  whether  there  exists  a  com- 
munication link  between  two  secondary  users,  we  need  to  check 
the  existence  of  spectrum  opportunities  in  both  directions. 

To  summarize,  under  the  disk  signal  propagation  and  inter- 
ference model,  there  is  a  (bidirectional)  link  between  A  and  B 
if  and  only  if  (CI)  the  distance  between  A  and  B  is  at  most  r^; 
(C2)  there  exists  a  bidirectional  spectrum  opportunity  between 
A  and  i.e.,  there  are  no  primary  transmitters  within  distance 
Rj  of  either  Aor  B  and  no  primary  receivers  within  distance  rj 
of  either  Aov  B. 

Connectivity:  We  interpret  the  connectivity  of  the  secondary 
network  in  the  percolation  sense:  the  secondary  network  is  con- 
nected if  there  exists  an  infinite  connected  component  a.s. 

Based  on  the  above  conditions  (CI,  C2)  for  the  existence  of  a 
communication  link,  we  can  obtain  an  undirected  random  graph 
Q{^s^^pt)  corresponding  to  the  secondary  network,  which 
is  determined  by  three  Poisson  point  processes:  the  secondary 

^Since  the  minimum  transmission  power  for  successful  reception  is,  in  gen- 
eral, higher  than  the  maximum  allowable  interference  power,  it  follows  that  the 
transmission  range  Rp  of  primary  users  is  smaller  than  Rj .  Furthermore,  under 
the  disk  signal  propagation  and  interference  model,  we  have  Rp  =  l3Ri  (0  < 
/3  <  1).  A  similar  relationship  holds  for     and  rj. 


Fig.  4.  Realization  of  the  heterogeneous  network.  The  random  graph 
G{Xs,  ^pt)  consists  of  all  the  secondary  nodes  and  all  the  bidirectional  links 
denoted  by  solid  lines.  The  solid  circles  with  radii  Rj  denote  the  interference 
regions  of  the  primary  transmitters  within  which  secondary  users  cannot 
successfully  receive  and  the  dashed  circles  with  radii  r  j  denote  the  required 
protection  regions  for  the  primary  receivers  within  which  the  secondary  users 
should  refrain  from  transmitting. 

users  with  density  A5,  the  primary  transmitters  with  density 
XpT  and  the  primary  receivers  with  density  Apt  (correlated 
with  the  process  of  the  primary  transmitters)^.  See  Fig.  4  for  an 
illustration  of  ^(A^,  Apt). 

The  question  we  aim  to  answer  in  this  paper  is:  when  is  the 
secondary  network  connected,  i.e.,  when  does  percolation  occur 

in  g{Xs,XpTV 

III.  Analytical  Characterizations  of  the 
Connectivity  Region 

Given  the  transmission  power  and  the  interference  tolerance 
of  both  the  primary  and  the  secondary  users  (i.e.,  Rp,  Rj,  rp 
and  ri  are  fixed),  the  connectivity  of  the  secondary  network  is 
determined  by  the  density  A5  of  the  secondary  users  and  the 
density  Apt  of  the  primary  transmitters.  We  thus  introduce  the 
concept  of  connectivity  region  C  of  a  secondary  network,  which 
is  defined  as  the  set  of  density  pairs  (A5,  Apt)  under  which  the 
secondary  network  ^(As,  Apt)  is  connected  (see  Fig.  1) 

C  =  {(A5,  Apt)  :  Q{Xs^Xpt)  is  connected.}. 

A.  Basic  Properties  of  the  Connectivity  Region 

We  next  establish  three  basic  properties  of  the  connectivity 
region. 

Theorem  1:  Basic  Properties  of  the  Connectivity  Region. 
Tl.l  The  connectivity  region  C  is  contiguous,  that  is,  for 
any  two  points  (A51,  Appi),  (A525  ^pt2)  ^  there  exists 
a  continuous  path  in  C  connecting  the  two  points. 
T1.2  The  lower  boundary  of  the  connectivity  region  C  is 
the  A^-axis.  Let  X'prp{\s)  denote  the  upper  boundary  of 
the  connectivity  region  C  defined  by  (1),  then  we  have  that 
Ap^(A5)  is  monotonically  increasing  with  A5. 

^The  two  Poisson  point  processes  of  the  primary  transmitters  and  receivers 
are  essentially  a  snap  shot  of  the  realizations  of  the  primary  transmitters  and 
receivers.  In  different  time  slots,  different  sets  of  primary  users  become  active 
transmitters/receivers.  Thus,  even  if  a  secondary  user  is  isolated  at  one  time  due 
to  the  absence  of  spectrum  opportunities,  it  may  experience  an  opportunity  at  a 
different  time  and  be  connected  to  other  secondary  users. 
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Fig.  5.  Two  realizations  of  the  Poisson  heterogeneous  network  (black  stars  denote  primary  transmitters,  green  plus  signs  denote  primary  receivers,  red  dots  denote 
secondary  users  and  blue  segments  denote  the  bidirectional  links  between  secondary  users).  We  have  removed  secondary  users  who  do  not  see  opportunities  for 
clarity.  The  simulation  parameters  are  given  by  Rp  =  50  m,  Rj  =  80  m,  Xs  =  650  km~^,  =  50  m,  rj  =  80  m  and  the  critical  density  in  this  case  is 
Ac(50)  ^  576  km~^  (a)  percolation  occurs  (Apt  =  10  km~^)  (b)  percolation  does  not  occur  (Apt  =  20  km~^). 


Tl  .3  There  exists  either  zero  or  one  infinite  connected  com- 
ponent in  ^(As,  Apt)  a.s. 

Proof:  The  proofs  of  Tl.l  and  T1.2  are  based  on  the 
coupHng  argument,  a  technique  frequently  used  in  continuum 
percolation  [15,  Section  2.2].  The  proof  of  T1.3  is  based  on 
the  ergodicity  of  the  random  model  driven  by  the  three  Poisson 
point  processes  of  the  primary  transmitters,  the  primary  re- 
ceivers and  the  secondary  users.  The  details  of  the  proofs  are 
given  in  Section  V-B.  ■ 

Tl.l  and  T1.2  specify  the  basic  structure  of  the  connectivity 
region,  as  illustrated  in  Fig.  1.  T1.3  implies  the  occurrence  of  a 
phase  transition  phenomenon,  that  is,  there  exists  either  a  unique 
infinite  connected  component  a.s.  or  no  infinite  connected  com- 
ponent a.s.  This  uniqueness  of  the  infinite  connected  component 
excludes  the  undesirable  possibility  of  having  more  than  one 
(maybe  infinite)  infinite  connected  component  in  the  secondary 
network.  We  point  out  that  such  a  property  is  not  always  present 
in  wireless  networks.  Two  examples  where  more  than  one  infi- 
nite connected  component  exists  in  a  homogeneous  ad  hoc  net- 
work can  be  found  in  [20] . 

B.  Critical  Densities 

In  this  subsection,  we  study  the  critical  density  A^  of  the  sec- 
ondary users  and  the  critical  density  Ap^  of  the  primary  trans- 
mitters defined  by  (2)  and  (3),  respectively.  We  have  the  fol- 
lowing theorem. 

Theorem  2:  Critical  Densities. 

Given  Rp,  Rj,  Vp  and  r/,  we  have: 

T2.1  A^  =  Ac(rp),  where  Xc{rp)  is  the  critical  density  for 
a  homogeneous  ad  hoc  network  with  transmission  range  Tp 
(i.e.,  in  the  absence  of  the  primary  network). 
T2.2 


Ac(l) 


4max{E|,r|}-r2 


(4) 


where  the  constant  Ac(l)  is  the  critical  density  for  a  homo- 
geneous ad  hoc  network  with  a  unit  transmission  range. 


Proof:  The  basic  idea  of  the  proof  of  T2. 1  is  to  approximate 
the  secondary  network  ^(A^,  Apt)  by  a  discrete  edge-percola- 
tion model  on  the  grid.  This  discretization  technique  is  often 
used  to  convert  a  continuum  percolation  model  to  a  discrete 
site/edge  percolation  model  (see,  for  example,  [15,  Chapter  3], 
[8]).  The  details  of  the  proof  are  given  in  Section  V-C-I. 

The  proof  of  T2.2  is  based  on  the  argument  that  if  there  is  an 
infinite  connected  component  in  the  secondary  network,  then  an 
infinite  vacant  component  must  exist  in  the  two  Poisson  Boolean 
models  driven  by  the  primary  transmitters  and  the  primary  re- 
ceivers, respectively.  The  key  point  is  to  carefully  choose  the 
radii  of  the  two  Poisson  Boolean  models  in  order  to  obtain  a 
valid  upper  bound  on  Ap^.  The  details  of  the  proof  can  be  found 
in  Section  V-C-II.  ■ 

Fig.  5(a)  shows  one  realization  of  the  heterogeneous  network 
when  A5  is  slightly  larger  than  Xdvp)  and  A pt  is  small.  At  least 
one  left-to-right  (L-R)  crossing  and  at  least  one  top-to-bottom 
(T-B)  crossing  can  be  found  in  the  square  network.  It  is  thus 
expected  that  these  L-R  and  T-B  crossings  in  finite  square  re- 
gions can  form  an  infinite  connected  component  in  the  whole 
network  on  [R^.  If  we  slightly  increase  Apt,  then  we  observe 
from  Fig.  5(b)  that  the  reduction  in  spectrum  opportunities  elim- 
inates numerous  communication  links  in  the  secondary  network, 
creating  several  disjoint  small  components. 

Fig.  6  shows  a  simulation  example  of  the  connectivity  region, 
where  the  upper  bound  on  the  critical  density  App  of  the  pri- 
mary transmitters  given  in  T2.2  appears  to  be  achievable. 

C.  A  Necessary  Condition  for  Connectivity 

In  this  subsection,  we  establish  a  necessary  condition  for  con- 
nectivity which  is  given  in  terms  of  the  average  conditional  de- 
gree of  a  secondary  user.  This  condition  agrees  with  our  intu- 
ition: the  secondary  network  cannot  be  connected  if  the  degree 
of  every  secondary  user  is  small. 

Let  D  ( A,  (i,  rx/tx)  denote  the  event  that  there  exist  primary  re- 
ceivers/transmitters within  distance  dofa.  secondary  user  A.  Let 
D(A,  (i,  rx/tx)  denote  the  complement  of       fi,  rx/tx).  Since  a 
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Fig.  6.  Simulated  connectivity  regions  when  =  150  m,  rj  —  240  m, 
—  100  m  and  Rj  —  120m.  The  blue  dashed  line  is  the  upper  bound  (4)  on 
the  critical  density  Ap^  of  primary  transmitters  given  in  T2.2.  The  area  of  the 
simulated  heterogeneous  network  is  2000m  x  2000m.  For  a  fixed  density 
of  the  secondary  users,  the  upper  boundary  Ap^(As)  is  equal  to  the  minimum 
density  of  the  primary  transmitters  such  that  over  all  the  1000  realizations,  the 
percentage  of  the  ones  in  which  there  exists  at  least  one  L-R  crossing  is  below 
50%.  The  intuitive  reason  for  choosing  the  existence  of  an  L-R  crossing  as  the 
criterion  for  connectivity  is  illustrated  in  Fig.  5  and  is  discussed  in  Section  V-C. 

secondary  user  is  isolated  if  it  does  not  see  a  spectrum  opportu- 
nity, we  focus  on  secondary  users  who  experience  spectrum  op- 
portunities and  define  the  conditional  average  degree  ii  of  such 
a  secondary  user  A  as 

^  =  E[deg{A)\  l(A,r/,rx)  n  l(A,i?7,tx)]  (5) 

where  deg{A)  denotes  the  degree  of  A,  rj  the  interference  range 
of  the  secondary  users  and  Rj  the  interference  range  of  the  pri- 
mary users.  Notice  that  the  degree  of  A  is  the  number  of  sec- 
ondary users  within  the  transmission  range  of  A  and  experi- 
encing bidirectional  opportunities  with  A.  We  arrive  at  the  fol- 
lowing necessary  condition  for  connectivity. 

Theorem  3:  A  necessary  condition  for  the  connectivity  of 
5(^5,  Apt)  is  /i  >  1,  where  /x  is  the  conditional  average  degree 
of  a  secondary  user  defined  in  (5). 

Proof:  The  basic  idea  is  to  construct  a  branching  process, 
where  the  conditional  average  degree  ^  is  the  average  number  of 
offspring.  This  branching  process  provides  an  upper  bound  on 
the  number  of  secondary  users  in  a  connected  component.  If  jj.  < 
1,  then  the  branching  process  is  finite  a.s.  It  thus  follows  that 
there  is  no  infinite  connected  component  a.s.  in  Q{Xsj  ^pt)- 
Details  can  be  found  in  Section  V-D.  ■ 

To  apply  the  necessary  condition  given  in  Theorem  3,  the 
conditional  average  degree  /i  of  a  secondary  user  A  needs  to 
be  evaluated  based  on  the  network  parameters.  Let  5  be  a  sec- 
ondary user  randomly  and  uniformly  distributed  within  trans- 
mission range  of  A.  Let  giXpT^'^^py'^iy  Rp^  Ri)  denote  the 
probability  of  a  bidirectional  opportunity  between  A  and  B  con- 
ditioned on  the  event  that  A  sees  an  opportunity.  Based  on  the 
statistical  equivalence  and  independence  of  different  points  in 
a  Poisson  point  process,  the  conditional  average  degree  /it  of  a 
secondary  user  A  is  given  by  this  conditional  probability  ^(•)  of 
a  bidirectional  opportunity  between  A  and  a  randomly  chosen 


neighbor  multiplied  by  the  average  number  of  neighbors  of  A, 
i.e., 

/I  =  (A^Trr^)  •  ^(ApT,  r^,  r/,  Rp,  Rj).  (6) 

The  detailed  derivation  of  (6)  and  an  expression  for  g{')  are 
given  in  Appendix  A.  It  is  also  shown  in  Appendix  A  that  ^(•)  is 
a  strictly  decreasing  function  of  Apt-  Thus  ^~^(-),  the  inverse 
of  ^(•)  with  respect  to  Apt,  is  well-defined. 

Combining  (6)  with  Theorem  3,  we  obtain  an  outer  bound  on 
the  connectivity  region.  Specifically,  let  /i(A5,  Apt)  denote  the 
conditional  average  degree  of  a  secondary  user  in  Q{Xs-,  Xpt)- 
Then  those  density  pairs  (A5,  Apt)  satisfying  )tx(A5,  Apt)  <  1 
are  outside  the  connectivity  region. 

Corollary  1:  Given  Rp,  Rj,  Vp  and  r/,  an  outer  bound  on  the 
connectivity  region  C  is  given  by 

Apt  =9~^  (  .  ^  o  ) 

where  g~^{')  is  the  inverse  of  the  conditional  probability  g{') 
with  respect  to  App. 

D.  A  Sufficient  Condition  for  Connectivity 

In  this  subsection,  we  establish  a  sufficient  condition  for  con- 
nectivity, which  provides  an  inner  bound  on  the  connectivity  re- 
gion and  a  criterion  for  checking  whether  a  secondary  network 
is  connected. 

Our  sufficient  condition  for  connectivity  is  established  by 
using  the  discretization  technique.  The  continuum  percolation 
model  is  mapped  onto  a  dependent  site-percolation  model  C 
in  the  following  way.  As  illustrated  in  Fig.  7,  we  partition  [R^ 
into  (dashed)  squares  with  side  length  dg  and  locate  a  site  at  the 
center  of  each  square.  Sites  whose  associated  dashed  squares 
share  at  least  one  common  point  are  considered  connected  (as 
illustrated  by  solid  lines  in  Fig.  7).  Thus,  each  site  is  connected 
to  eight  neighbors4  (see  the  eight  neighbors  Oi, . . . ,  Og  of  site 
O  in  Fig.  7).  Let  Bq  be  the  associated  dashed  square  of  O,  then 
O  is  occupied  if  there  exists  in  Bq  at  least  one  secondary  user 
who  sees  an  opportunity. 

Since  the  largest  distance  between  two  points  in  two  neigh- 
boring dashed  squares  is  2\f2ds,  it  follows  that  if  we  set  d^  — 
then  every  pair  of  secondary  users  in  two  neighboring 
dashed  squares  are  within  transmission  range  Vp  of  each  other. 
Based  on  the  definitions  of  occupied  site  in  C  and  communi- 
cation link  in  the  secondary  network,  we  conclude  that  the  ex- 
istence of  an  infinite  occupied  component  (a  connected  compo- 
nent consisting  of  only  occupied  sites)  in  C  implies  the  existence 
of  an  infinite  connected  component  in  the  secondary  network. 

Due  to  the  fact  that  spectrum  opportunities  are  spatially  de- 
pendent, the  state  of  one  site  is  correlated  with  the  states  of  its 
adjacent  sites.  Thus,  the  above  site-percolation  model  £  is  a  de- 
pendent model.  Define  the  dependence  range  k  as  the  minimum 
distance  such  that  the  state  of  any  two  sites  at  distance  larger 
than  k  are  independent,  where  the  distance  between  two  sites 

^For  the  commonly  used  square  site-percolation  model,  each  site  has  four 
neighbors.  The  site-percolation  model  constructed  here  provides  a  better  inner 
bound. 
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Fig.  7.  Illustration  of  the  dependent  site-percolation  model  C  with  side  length 
ds  (solid  dots  denote  sites,  solid  lines  denote  edges  connecting  every  two  sites 
and  dashed  lines  denote  the  squared  partition). 

is  the  minimum  number  of  neighboring  sites  that  must  be  tra- 
versed from  one  site  to  the  other.  Then  the  dependence  range  of 
L  is  given  by 


=  [8max{i^j  +  ^,rj  +  ^}/r^ 


1. 


(7) 


Let  pc  denote  the  upper  critical  probabiHty  of  C  which  is  de- 
fined as  the  minimum  probabiHty  of  occupancy  p*  such  that  if 
the  probabiHty  of  occupancy  p  >  ,  an  infinite  occupied  com- 
ponent containing  the  origin  exists  in  jC  with  a  positive  prob- 
abiHty (wpp.).  Since  the  dependence  range  A;  of  £  is  finite,  it 
foHows  from  [12,  Theorem  2.3.1]  thatpc  <  1-  Now  we  present 
a  sufficient  condition  for  connectivity  in  the  foHowing  theorem. 

Theorem  4:  Let  pc  denote  the  upper  critical  probability  of  the 
dependent  site-percolation  model  C  specified  above.  Define 


I(r,Rp,ri)  =  2 


ttRI 


(8) 


where  Rp^rj)  is  the  common  area  of  two  circles  with  radii 
Rp  and  r/  and  centers  t  apart.  Then  the  secondary  network  is 
connected  if  the  equation  shown  at  the  bottom  of  the  page  holds. 

Proof:  The  proof  is  based  on  the  ergodicity  of  the  hetero- 
geneous network  model  and  its  relation  with  the  constructed 
dependent  site-percolation  model  C.  Details  can  be  found  in 
Section  V-E.  ■ 
By  applying  a  general  upper  bound  on  the  upper  critical  prob- 
ability Pc  for  a  site-percolation  model  with  finite  dependence 
range  [12,  Theorem  2.3.1],  we  arrive  at  the  following  corollary. 


Corollary  2:  A  sufficient  condition  for  the  connectivity  of 

g{\s,\pT)  is 


In 


1  —  exp 


(-¥) 


I  \R]  +  ,j  -  I(R,,  flp,  r,)]       1  _  (1)1^'+"" 


where  /(i^/,  Rp^  r/)  is  defined  in  (8)  and  k  is  the  dependence 
range  of  the  site-percolation  model  defined  in  (7). 

IV.  Impact  of  Transmission  Power:  Proximity 
Versus  Opportunity 

In  this  section,  we  study  the  impact  of  the  secondary  users' 
transmission  power  on  the  connectivity  and  the  conditional  av- 
erage degree  of  the  secondary  network.  As  has  been  illustrated 
in  Fig.  2,  there  exists  a  tradeoff  between  proximity  and  oppor- 
tunity in  designing  the  secondary  users'  transmission  power  for 
connectivity.  Specifically,  increasing  the  transmission  power  of 
the  secondary  users  leads  to  a  smaller  critical  density  A  J  of  the 
secondary  users,  but  at  the  same  time,  a  lower  tolerance  to  the 
primary  traffic  load  manifested  by  a  smaller  critical  density  Ap^ 
of  the  primary  transmitters. 

A.  Impact  on  the  Conditional  Average  Degree 

As  discussed  in  Section  III-C,  the  expression  for  the  condi- 
tional average  degree  fi  can  be  decomposed  into  the  product 
of  two  terms:  A^Trr^  and  g{\pT^Tp^ri^Rp^Ri).  The  first 
term  is  the  average  number  of  neighbors  of  a  secondary 
user,  which  increases  with  the  transmission  power  pt^  of  the 
secondary  users  (i.e.,  enhanced  proximity).  The  other  term 
g{XpTy  Tp^ri^Rp^Ri)  is  the  conditional  probability  of  a  bidi- 
rectional opportunity,  which  decreases  with  ptx  due  to  reduced 
spectrum  opportunities.  This  tension  between  proximity  and 
opportunity  is  illustrated  in  Fig.  8,  where  we  observe  that 
the  impact  of  ptx  on  proximity  dominates  when  ptx  is  small 
(/X  increases  with  ptx)  while  its  impact  on  the  occurrence  of 
opportunities  dominates  when  ptx  is  large  (/x  decreases  with 

Ptx)- 

Corollary  3:  Let  ptx  be  the  transmission  power  of  secondary 
users  and  the  conditional  average  degree  defined  in  (5),  then 
under  the  disk  signal  propagation  and  interference  model  we 
have^ 


/J^  =  0({ptx)  ^/""^asfe 


where  a  is  the  path-loss  exponent. 

Proof:  We  show  this  corollary  by  deriving  an  upper  bound 
on  ^.  Details  can  be  found  in  Appendix  B.  ■ 

^Here  we  use  the  Big  O  notation:  f{x)  —  0{g{x))  slsx  — ^ooif  and  only  if 
3  M  >  0,  xo  >  0  such  that  \f{x)\  <  M\g(x)\  for  all  a:  >  Xq. 


1  —  exp 


exp{-ApT7r  [Rj  +  rj  -  I  (Ri,  Rp.ri)]}  >  pc 
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Fig.  8.  Conditional  average  degree  (i  of  secondary  users  vs  transmission  range 
Fp  of  secondary  users,  oc  (ptx)^ ,  where  ptx  is  the  transmission  power  of 
secondary  users  and  a  is  the  path-loss  exponent  and  simulation  parameters  are 
given  by  Apt  =  2.5  km-^,  =  200  m,  Ri  =  250  m,  Xs  =  25  km-^, 
rj  =  rp/O.S. 

For  a  homogeneous  network,  the  average  degree  of  a  user  is 
XiTTp,  which  increases  with  ptx  at  rate  {ptx)^^^ •  In  sharp  con- 
trast, this  corollary  tells  us  that  for  a  heterogeneous  network, 
when  is  large  enough,  the  conditional  average  degree  /x  of 
a  secondary  user  actually  decreases  with  ptx  at  least  as  fast  as 

B.  Impact  on  the  Connectivity  Region 

From  the  scaling  relation  of  the  critical  density  [15,  Propo- 
sition 2.11],  we  know  that  in  a  homogeneous  two-dimensional 
network 

Ac(rp)  =  Ac(l)  (r^)"^  oc  {ptx)~^ 

where  the  constant  Ac(l)  is  the  critical  density  for  a  homoge- 
neous ad  hoc  network  with  a  unit  transmission  range.  Thus, 
if  each  secondary  user  adopts  a  high  transmission  power,  then 
Ac(^p)  decreases.  It  follows  from  T2.1  that  the  critical  den- 
sity A  J  of  secondary  users  required  to  achieve  connectivity  de- 
creases due  to  the  enhanced  proximity. 

On  the  other  hand,  from  the  upper  bound  on  the  critical  den- 
sity Ap^  of  the  primary  transmitters  given  in  T2.2,  we  have  that 

=  O  (^(fe)"^/"")  as  Ptx  oo 

where  we  have  assumed  that  =  j3rj  for  some  /?  G  (0, 1) 
under  the  disk  signal  propagation  and  interference  model.  Thus, 
when  the  transmission  power  ptx  of  the  secondary  network  is 
large  enough,  the  critical  density  Ap^  of  the  primary  transmit- 
ters decreases  with  ptx  at  least  as  fast  as  {ptx)~'^^^  due  to  re- 
duced spectrum  opportunities. 

C.  Optimal  Design  of  Transmission  Power 

Due  to  the  tension  between  proximity  and  opportunity,  there 
does  not  exist  a  transmission  power  of  the  secondary  users  that 
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Fig.  9.  Example  of  the  upper  bound  on  Ap^  as  a  function  of  rj  (parameters 
are  given  by  Rj  —  120  m,      =  0.625rj). 

leads  to  the  "largest"  connectivity  region  (largest  in  the  sense 
that  its  connectivity  region  contains  all  regions  achievable  with 
any  finite  transmission  power  p^^^  of  the  secondary  users).  Thus, 
the  optimal  design  of  ptx  depends  on  the  operating  point  of  the 
heterogeneous  network.  For  instance,  when  a  sparse  secondary 
network  is  overlaid  with  a  primary  network  with  low  traffic  load, 
a  large  ptx  may  be  desirable  to  achieve  connectivity.  The  oppo- 
site holds  when  a  dense  secondary  network  is  overlaid  with  a 
primary  network  with  high  traffic  load. 

Focusing  on  a  secondary  network  whose  density  exceeds  the 
critical  density  of  the  secondary  users,  we  address  the  design 
of  its  transmission  power  for  the  maximum  tolerance  to  the  pri- 
mary traffic.  Due  to  its  tractability  and  achievability,  the  upper 
bound  on  the  critical  density  Ap^  of  the  primary  transmitters 
given  in  T2.2  is  used  as  the  performance  measure  (see  Fig.  6  for 
a  simulation  example). 

Theorem  5:  Let  r/  and  Ri  denote  the  interference  range  of 
the  secondary  and  the  primary  users.  For  a  fixed  Rj,  the  upper 
bound  on  App  given  in  T2.2  is  maximized  when  the  primary  and 
secondary  networks  have  matching  interference  ranges:  rj  = 
Rj. 

Proof:  Since  under  the  disk  signal  propagation  and  inter- 
ference model,  rp  =  prj  for  some  p  G  (0, 1),  the  upper  bound 
on  App  can  be  written  as 

^^^/i^^  forr,<i?, 
[  X^^pj^     forr/  >  Ri. 

Then  the  above  theorem  can  be  readily  shown  by  finding  the 
maximal  point  for  the  two  cases:  rj  <  Rj  and  rj  >  Rj.  ■ 

An  example  of  the  upper  bound  on  App  is  plotted  as  a  func- 
tion of  rj  in  Fig.  9.  Notice  that  there  is  a  distinct  difference 
in  the  slope  on  the  two  sides  of  the  optimal  point.  As  a  conse- 
quence, the  operating  region  of  rj  <  Rj  is  preferred  over  that  of 
rj  >  Rj  when  the  optimal  point  r/  =  Rj  cannot  be  achieved. 
We  point  out  that  the  desired  operating  region  of  rj  <  Rj  is  the 
typical  case  of  a  secondary  network  coexisting  with  a  privileged 
primary  network. 
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V.  Proofs 

In  this  section,  we  present  proofs  of  the  main  results  presented 
in  Sections  III-IV.  We  start  with  a  brief  overview  of  several 
basic  results  in  percolation  theory  that  will  be  used  in  the  proofs. 

A.  Percolation  Theory:  Some  Basic  Results 

Poisson  Boolean  Model:  Poisson  Boolean  model  [15],  often 
referred  to  as  B{X^  p,  A),  is  specified  by  two  elements:  a 
Poisson  point  process  X  on  with  density  A  and  a  radius  p, 
a  random  variable  with  a  given  distribution.  Under  this  model, 
each  point  in  X  is  the  center  of  a  ball  in  with  a  random 
radius  distributed  according  to  the  distribution  of  p.  Radii  asso- 
ciated with  different  points  are  independent  and  they  are  also 
independent  of  points  in  X.  Under  a  Poisson  Boolean  model, 
the  whole  space  is  partitioned  into  two  regions:  the  occupied 
region,  which  is  the  region  covered  by  at  least  one  ball  and  the 
vacant  region,  which  is  the  complement  of  the  occupied  region. 
We  define  occupied  (vacant)  components  as  those  connected 
components  in  the  occupied  (vacant)  region. 

Assume  that  nodes  in  a  homogeneous  ad  hoc  network  form 
a  Poisson  point  process  with  density  A  and  their  transmission 
range  is  r.  It  is  easy  to  see  that  the  connectivity  of  this  network 
can  be  studied  through  examination  of  the  occupied  connected 
components  in  the  corresponding  Poisson  Boolean  model 
B{X,  r/2,  A). 

Sharp  Transition  in  Two  Dimensions:  For  the  Poisson 
Boolean  model  in  two  dimensions,  the  sharp  transition  phe- 
nomenon is  remarkable  in  the  sense  that  the  critical  density 
for  the  a.s.  existence  of  infinite  occupied  components  is  equal 
to  that  for  the  a.s.  existence  of  infinite  vacant  components. 
Let  Ac(2p)  denote  the  critical  density  for  the  Poisson  Boolean 
model  B{X^  p,  A),  then  we  have  that: 

•  when  A  <  Ac(2p),  there  is  no  infinite  occupied  component 
a.s.  and  there  is  a  unique  infinite  vacant  component  a.s.; 

•  when  A  >  Ac(2p),  there  is  a  unique  infinite  occupied  com- 
ponent a.s.  and  there  is  no  infinite  vacant  component  a.s. 

The  exact  value  of  Ac  is  not  known.  For  a  deterministic  radius  p, 
simulation  results  [21]  indicate  that  Ac(2p)  ^  0.36p~^,  while 
rigorous  bounds  0.192p~^  <  Ac(2p)  <  0.843p~^  are  provided 
in  [15]  and  [22]. 

Crossing  Probabilities:  A  continuous  curve  in  the  occupied 
region  is  called  an  occupied  path.  An  occupied  path  7  is  an  oc- 
cupied L-R  crossing  of  the  rectangle  {0  <  x  <  h}  x  {0  < 
y  <  ^2}  if  7  intersects  both  the  left  and  the  right  boundaries 
of  the  rectangle,  i.e.,  7  fi  {{x  =  0}  x  {0  <  y  <  I2})  ^  0, 
7  n  {{x  =  h}  X  {0  <  y  <  I2})  7^  (p  and  the  segment  between 
the  two  intersecting  points  is  fully  contained  in  the  rectangle. 
Similarly,  we  define  an  occupied  T-B  crossing  by  requiring  that 
7  intersects  with  the  top  and  bottom  boundaries  of  the  rectangle. 
Let 

a((/i,/2),  A,  L-R) 

=  Pr{3  an  occupied  L  —  R  crossing  of  [0,  li]  x  [0, 12]} 
cj{{hM).  A,  T-B) 

=  Pr{3  an  occupied  T-B  crossing  of  [0,  /i]  x  [0, 12]} 


{\S2^  ApT2) 
# 
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(A52,  ApTl) 
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{\s2i  ApTl) 
 <D 


{\s2i  ApT2) 
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<  Ap 
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>  A^ 


Ac 


Fig.  10.  Continuous  path  connecting  the  two  points  {Xsi^^pti) 
and  (A52,ApT2)  in  the  connectivity  region  C.  (a)  Appi  <  Xpt2- 
(b)  Appi  >  App2- 


denote  the  two  crossing  probabilities  in  the  rectangle  [0,  /i]  x 
[0, 12]-  Then  for  a  Poisson  Boolean  model  B{X^  p,  A)  in  two 
dimensions  with  a.s.  bounded  p,  we  have  [15,  Corollary  4.1] 
that  for  any  k  >  1 

^lim^.pn,n),A,L-R)  =  {;;    ^^^^^  (9) 

Due  to  the  symmetry  of  the  Poisson  Boolean  model, 
similar   results   hold   for   the   T-B    crossing  probability 

a{{n,kn),  A,  T-B). 


B.  Proof  of  Theorem  1 

Proof  of  Tl.l:  To  prove  Tl.l,  it  suffices  to  show  that  for  any 
two  given  points  (A51,  Apti)  and  (A52,  ^pt2)  inC,  we  can  find 
a  path  in  C  that  connects  these  two  points.  In  particular,  the  path 
we  constructed  is  given  by  a  horizontal  segment  and  a  vertical 
segment  as  shown  in  Fig.  10,  where  we  assume,  without  loss  of 
generality,  that  A51  <  A52- 

Consider  case  (a)  in  Fig.  10  where  \pti  <  ^pt2-  Case 
(b)  can  be  proven  similarly.  First,  we  show  every  point 
{^s^^PTi)  <  A5  <  A52)  on  the  horizontal  segment  be- 
longs to  C.  Let  A'  =  A5  —  A51.  A  Poisson  point  process  X  with 
density  A5  is  statistically  equivalent  to  the  superposition  of  a 
Poisson  point  process  Xi  with  density  A51  and  an  independent 
Poisson  point  process  X'  with  density  A'.  It  follows  that  any 
realization  of  the  heterogeneous  network  with  densities  A 5  and 
XpTi  can  be  generated  by  adding  more  secondary  nodes  to  a 
realization  of  the  heterogeneous  network  with  densities  A51  and 
XpTi  -  Thus,  the  existence  of  an  infinite  connected  component 
in  ^(Asi,  ApTi)  implies  the  existence  of  an  infinite  connected 
component  in  ^(A^,  Apti)-  We  thus  have  that  (A5,  Apti)  G  C 
for  (A51  <  A5  <  A52). 

Now  we  know  that  the  two  end  points  (A52,Apti)  and 
(A52,  ApT2)  of  the  vertical  segment  belong  to  C.  For  a  point 
(A52,App)  (Appi  <  App  <  App2)  on  the  vertical  segment, 
let  A'  =  ApT2  —  XpT,  then  any  realization  of  the  heteroge- 
neous network  with  densities  A52  and  App  can  be  obtained 
by  independently  removing  each  primary  transmitter-receiver 
pair  with  probability  \' /\pT2  from  a  realization  of  the  het- 
erogeneous network  with  densities  A52  and  App2.  It  follows 
from  the  definition  of  communication  link  in  the  secondary 
network  (see  Section  II-B-I)  that  the  existence  of  an  infinite 
connected  component  in  Q{\s2^XpT2)  implies  the  existence 
of  an  infinite  connected  component  in  ^(As,  App).  Thus,  we 
have  (A52,App)  G  C  {Xppi  <  App  <  App2). 


63 


Proof  of  Theorem  1.2:  Suppose  that  (A5,  Apt)  ^  C  {XpT  > 
0) ,  then  by  using  the  coupHng  argument  for  showing  that  the  ver- 
tical segment  belongs  to  C  in  the  above  proof  of  T  1.1,  we  con- 
clude that  (A5,  0)  G  C,  i.e.,  the  A^-axis  is  the  lower  boundary 
of  C. 

Suppose  that  A52  >  A51  >  0.  In  order  to  prove  the  mono- 
tonicity  of  Ap^(A5)  with  A 5  it  suffices  to  show  that  V  Apt  >  0, 
if  (A51,  Apt)  G  C  then  (A52,  Apt)  G  C.  This  is  a  direct  conse- 
quence of  the  coupling  argument  for  showing  that  the  horizontal 
segment  belongs  to  C  in  the  above  proof  of  T  1.1. 

Proof  of  Theorem  1.3:  We  first  establish  the  ergodicity^  of 
the  heterogeneous  network  model. 

Lemma  1:  The  heterogeneous  network  model  is  ergodic. 
Proof  of  Lemma  1:  The  proof  of  this  lemma  is  inspired 
by  the  proof  of  the  ergodicity  of  the  Poisson  Boolean  model 
[15,  Proposition  2.8].  The  difficulty  here  is  that  for  the  hetero- 
geneous network  model,  we  have  two  correlated  Poisson  point 
processes:  the  primary  transmitters  and  the  primary  receivers. 
The  definition  of  the  shift  transformation  for  the  primary  net- 
work model  is  thus  more  complicated  than  the  standard  Poisson 
Boolean  model  with  a  deterministic  radius  p.  To  prove  Lemma 
1,  we  first  show  the  ergodicity  of  the  primary  network  model 
and  then  we  show  the  mixing  property^  of  the  secondary  net- 
work model.  Since  the  primary  network  model  is  independent 
of  the  secondary  network  model,  it  follows  from  [24,  Theorem 
2.6.1]  that  the  heterogeneous  network  model  is  ergodic. 

d  , 

Let       denote  the  Borel  cr-algebra  in     and  TV  the  set  of  all 

d 

simple  counting  measures^  on  ^   .  Construct  a  cr-algebra 
for  N  generated  by  sets  of  the  form 

{neN  :  n(A)  =  k} 

d 

where  A  e  and  k  is  an  integer.  A  point  process  X  can  now 
be  defined  as  a  measurable  mapping  from  a  probability  space 
{n,  ^  ,  P)  into  (TV,  jV)  [23,  Chapter  7].  The  measure  \i  on 
JV  induced  by  X  is  defined  as  ^(G)  =  F[X-^[G)),  for  all 
G  e  J". 

In  order  to  define  the  shift  transformation  on  il,  it  is  conve- 
nient to  identify  (Q,  ^  )  with  (TV,  jV).  Let  u;(A)  denote  the 

d 

number  of  points  in  A  E  ,Vct;G^l  and  Tx  be  the  shift 
according  to  a  vector  x  eU^.  Then  Tx  induces  a  shift  transfor- 
mation Sx  :  — ^     through  the  following  equation  for  every 

{SxUj){A)=uj{T-'A).  (10) 

measure  preserving  (m.p.)  dynamical  system  (O,  ^  ,  ji,  T)  consists  of 

a  set  ri,  a  cr -algebra  ^  of  measurable  subsets  of  O,  a  nonnegative  measure 

//  on        ^  )  and  an  invertible  m.p.  transformation  T  :  Q  —^Q  such  that 

^i{T-^F)  =  ^{F)  for  all     e  ^  .  A  set  F  G  ^  is  said  to  be  T-invariant  if 

T~^F  —  F.  The  m.p.  dynamical  system  ^  ,  fi,  T)  is  said  to  be  ergodic 
if  the  (7-algebra  of  T-invariant  sets  is  trivial,  i.e.,  for  any  invariant  set,  either  it 
has  measure  0  or  its  complement  has  measure  0. 

^An  m.p.  dynamical  system        ^  ,  /w,  T)  is  said  to  be  mixing  if  for  all 

E,F  e  ^  ,    Imi  n{T^E  f]  F)  =  n{E)n{F). 

d 

simple  counting  measure  on  ^  is  an  integer-valued  measure  for  which 
the  measures  of  bounded  Borel  sets  are  all  finite  and  the  measure  of  a  point  is 
at  most  1 . 


Let  (llpT,  ^  PTy  Ppt)  be  the  probability  space  of 
the  Poisson  point  process  Xpt  for  the  primary  transmit- 
ters with  density  Apt-  Let  VLpr  be  the  product  space 
rinGN  Ylzez^  ^Rp  primary    receivers,  where 

Cr^  =  {{x^y)  :  x'^  -\-  y'^  <  Rp}.  Then  we  equip  ftpR  with 
the  usual  product  cr-algebra  and  product  measure  PpR  with  all 
marginal  probability  measure  being  where  /j^u  is  a  uniform 
probability  measure  on  Cr^  .  Finally,  we  set  ^^p  =  ^pt  x^pr 
and  equip  p  with  the  product  measure  Pp  =  Ppp  x  PpR  and 
the  usual  product  cr-algebra.  It  follows  that  the  primary  network 
model  is  a  measurable  mapping  from  Q^p  into  A^pt  x  ^pr 
defined  by  {cupTy  ^pr)  —^{Xpt{oopt)^  ^pr)^  where  Npp  is 
specified  in  the  definition  of  the  point  process. 

The  positions  of  the  primary  transmitters  corresponding  to 
((jpT,  ojpr)  G  ^IpT  X  ^PR  are  easily  known  from  upp- 
For  the  primary  receivers,  the  positions  are  obtained  as  follows. 
Consider  binary  cubes 

2 

K{n,z)  :=  ]^(^i2-^,(^,  +  l)2-^]for  alln  G  Nand^  G  Z^. 

i=i 

For  each  primary  transmitter  x pp,  there  exists  a  unique  smallest 
integer  no  =  no(x pt)  such  that  it  is  contained  in  a  binary  cube 
K(no^  z{no^  xpp))  which  contains  no  other  primary  transmit- 
ters. The  relative  position  of  xpp '  s  receiver  with  respect  to  xpp 
is  then  given  by  ujpR(no,  z{no,  xpp)). 

Let  ei,  e2  denote  the  unit  vectors  in  [R^,  then  the  translation 
'.U'^  -^U'^  {i  =  l,2)  defined  hyx-^x  +  Ci  induces  a  shift 
transformation  U^-  on  ^pr  through  the  equation 

{Ue,u)pR){n,  z)  =  ujpR{n,  z  -  2'' a),  for  ^  =  1,  2. 

Hence  T^-  also  induces  a  shift  transformation  Tg^  on  lip  = 
^7pT  X  Q Pi?  as  follows: 

fe^UJp)  =  (Se^UJpTj  Ue^UJpR),  fOY  i  =  1,2 

where  5'e-  is  defined  in  (10).  By  using  techniques  similar  to  the 
proof  of  Boolean  models  [15,  Proposition  2.8],  we  have  that  the 
m.p.  dynamical  system  (ilp,  ^  p,  Pp,  TeJ  is  ergodic. 

Since  the  transmission  range  of  secondary  users  is  fixed, 
the  probability  space  of  the  secondary  network  model  is  the 
probability  space  (^1^,  ^5,  Ps)  for  the  Poisson  point  process 
X5  of  secondary  users  with  density  A5.  It  follows  from  the 
proof  of  Poisson  point  processes  [15,  Proposition  2.6]  that  the 
m.p.  dynamical  system  (^1^,  ^5,  P5,  Se^)  is  mixing. 

Since  the  primary  network  model  is  independent  of  the  sec- 
ondary network  model,  the  sample  space  of  the  heterogeneous 
network  model  ^1  can  be  written  as  the  product  of  ^lp  and 
i.e.,  =  Qp  X  Qs-  We  equip  with  product  measure  P  = 
Pp  X  Ps  and  the  usual  product  cr-algebra.  Similarly,  the  trans- 
lation Tg.  (i  =  1,2)  induces  a  transformation  T^-  on  ft  = 
Q.p  X  Q5,  which  is  given  by 

Te^{u)  =  (fe^CxJp.Se^UJs)' 

Then  it  follows  from  [24,  Theorem  2.6.1]  that  the  product  m.p. 
dynamical  system  (11,  ^  ,  P,  TeJ  is  ergodic.  Since  the  cr-al- 
gebra  invariant  under  the  transformation  group  {T^  :  z  G  Z^} 
is  a  subset  of  the  cr-algebra  invariant  under  the  transformation 
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Te^,  we  conclude  that  {Tz  :  z  e  Z'^}  acts  ergodically,  i.e.,  the 
heterogeneous  network  model  is  ergodic.  ■ 
Based  on  Lemma  1,  we  have  the  following  lemma. 

Lemma  2:  The  number  of  infinite  connected  component  in 
5(^5,  Apt)  is  a  constant  a.s.  and  it  can  only  take  value  from 
{0,  1,  oo}. 

Proof  of  Lemma  2:  Let  K  denote  the  (random)  number  of 
infinite  connected  components  in  ^(A^,  Apt),  then  since  for  all 
k  >  0,  the  event  {K  =  k}  is  invariant  under  the  group  of  shift 
transformations,  it  follows  from  Lemma  1  and  the  definition  of 
ergodicity  that  Fy{K  =  /c}  =  0  or  1.  Consequently,  we  have 
that  K  is  a.s.  constant.  Then  it  suffices  to  exclude  the  possibility 
ofK  >  2.  This  is  shown  by  contradiction,  that  is,  if  there  exist 
K  >  2  infinite  connected  components,  then  they  can  be  linked 
together  as  one  connected  component  wpp.  The  proof  is  inspired 
by  the  proof  of  Proposition  3.3  in  [15]  and  a  major  difference  is 
that  here  we  need  to  consider  the  impact  of  the  primary  network 
on  the  connectivity  of  the  secondary  network. 

Suppose  that  there  are  >  2  infinite  connected  components 
a.s.  If  we  remove  all  the  secondary  nodes  centered  inside  a  box 
B  =  [— n,n]^,  then  the  resulting  secondary  network  should 
contain  at  least  K  unbounded  components  a.s.  Let,  for  C  IR^, 
G[A]  denote  the  graph  formed  by  secondary  nodes  in  A.  Given 
a  box  B  and  e  >  0,  consider  the  event  E{B^  e)  shown  at  the 
bottom  of  the  page. 

Partition  the  box  B  into  squares  with  side  length  a  >  0 
and  let  <Sa  =  {aS'i,  . . . ,  Sn}  denote  the  collection  of  all  the 
squares  which  are  adjacent  to  the  boundary  of  B.  Clearly,  for 
a  box  B  and  e  >  0,  we  can  find  a  =  a{B^e)  G  (0,rp/\/5) 
and  T]  =  r]{a)  >  0  such  that  for  any  point  x  ^  B  with 
d{x^B)  <  Vp  —  e/2,  there  exists  a  square  S  =  S{x)  G  Sa 
for  which  we  have  sup^^^  d{x^  y)  ^  Tp  —  rj.  This  means  that, 
if  we  center  in  each  square  of  Sa  a  secondary  node  and  there 
are  neither  primary  transmitters  nor  primary  receivers  within  a 
bigger  box  5  =  [— n  —  max{r/,  i^/},  n  +  max{r/,  i?/}]^,  then 
every  infinite  component  U  in  Q[B^]  with  d{U^  B)  <  —  e  is 
connected  to  some  secondary  node  in  Sa- 

Let  E{a^r])  be  the  event  that  each  square  in  Sa  contains  at 
least  one  secondary  node  and  E{B)  the  event  that  there  are  nei- 
ther primary  transmitters  nor  primary  receivers  within  B.  Since 
E{a^r])  depends  on  the  configuration  of  secondary  nodes  inside 
the  box  B,  E{B^e)  depends  on  the  configuration  of  secondary 
nodes  outside  B  and  the  configuration  of  primary  nodes,  based 
on  the  independence  of  the  primary  network  and  the  secondary 
network,  we  have 


Fy{E(B,€)  n  E{a,r])  n  E{B)) 

=  Ft{E{B,  6))Pr(^(a,  v))Ft{E{B)\E{B,  e)). 


If  E{B^  e),  E{a^  rf)  and  E{B)  all  occur  wpp.,  then  there  is 
only  one  infinite  connected  component^  wpp.  By  using  argu- 
ments similar  to  the  proof  for  [15,  Proposition  3.3],  we  have 
that  there  exists  a  large  enough  box  B  and  e  >  0  such  that 
FT{E{B,e))  >  0.  Obviously,  Pr(^(a,7y))  >  0.  Moreover,  it 
iseasy  toseethatP(£;(5)|E(B,e))  >  P{E{B))  >  0.  ■ 

Now  we  have  that  the  number  K  of  infinite  connected  com- 
ponents is  equal  to  zero,  one  or  infinity  a.s.  To  exclude  the  pos- 
sibility of  K  =  oc,  we  can  use  the  proof  of  [15,  Theorem  3.6] 
for  Poisson  Boolean  models,  which  is  based  on  several  combi- 
natorial results.  The  details  are  omitted. 

C.  Proof  of  Theorem  2 

Proof  of  T2.1:  To  prove  T2.1,  it  suffices  to  show  that: 

(a)  for  any  A5  <  Ac(rp),  the  secondary  network  is  not  con- 
nected for  any  Apt  >  0; 

(b)  for  any  A5  >  Ac(rp),  there  exists  a  X^prp{\s)  >  0 
such  that  V  Apt  <  Ap^(A5),  the  secondary  network  is 
connected. 

From  Section  IV-A,  we  know  that  for  a  Poisson  homogeneous 
ad  hoc  network  with  density  A  and  transmission  range  r,  the  nec- 
essary and  sufficient  condition  for  connectivity  is  A  >  Ac(r). 
Since  the  existence  of  an  infinite  connected  component  in  the 
secondary  network  implies  the  existence  of  an  infinite  connected 
component  in  the  homogeneous  ad  hoc  network  with  the  same 
density  and  the  same  transmission  range,  by  using  a  coupling 
argument,  we  conclude  that  when  A5  <  Ac(rp),  there  does  not 
exist  an  infinite  connected  component  a.s.  in  the  secondary  net- 
work for  any  Apt  >  0.  This  proves  part  (a). 

The  basic  idea  of  the  proof  of  part  (b)  is  to  approximate  the 
secondary  network  ^(A^,  Apt)  by  a  discrete  dependent  edge- 
percolation  model  on  the  grid.  This  discrete  dependent  edge- 
percolation  model  C  is  constructed  in  a  way  such  that  the  exis- 
tence of  an  infinite  connected  component  in  jC  implies  the  exis- 
tence of  an  infinite  connected  component  in  ^(As,  Apt). 

Construct  the  square  lattice  £  on  IR^  with  side  length  d^.  Note 
that  each  site  in  C  is  virtual  and  is  not  related  to  any  node  either 
in  the  secondary  network  or  in  the  primary  network.  Next  we 
specify  the  conditions  for  an  edge  being  open  in  £,  which  is  the 
key  to  the  mapping  from  Q{Xsy  Apt)  to  jC. 

For  each  edge  e  in  £,  let  {x^ ,  He)  denote  the  middle  point  of  e. 
Then  we  introduce  three  random  fields  A^,  Be  and  Ce,  all  asso- 
ciated with  the  edge  e  in  £,  where  Ce  =  A^B^  is  the  indicator 
of  the  edge  e  being  open,  A^  represents  the  condition  (CI)  of 
the  distance  between  two  users  for  the  existence  of  a  commu- 
nication link  in  the  secondary  network  and  B^  represents  the 
condition  (C2)  of  the  spectrum  opportunity.  Specifically,  con- 
sider the  Poisson  Boolean  model  B{Xsy  rp/2^  A5)  where  Xs 

^Since  a  <  r^/  every  secondary  node  in  a  square  of  Sa  is  connected  to 
those  secondary  nodes  in  the  neighboring  squares. 


E{B^  e)  :=  {d{U^  B)  <  Vp  —  e  for  any  infinite  connected  component  U  in  5 [5^]} 
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Fig.  11.  Realization  where  —  1  for  the  edge  e  (hollow  points  are  sites  in 
C  and  solid  segments  are  edges  in  £). 


is  the  Poisson  point  process  generated  by  secondary  users,  then 
for  a  horizontal  edge  e,  A^,  —  1  if  the  following  two  events  (il- 
lustrated in  Fig.  11)  are  true: 

(i)  there  is  an  occupied  L-R  crossing  of  the  rectangle 

-  34/4,  +  34/4]  X  [ye  -  4/4,  Ve  +  4/4]  in 
B[Xs,  r^/2,  \s)\ 

(ii)  there  are  two  occupied  T-B  crossings  of  the  square  [a^e  — 
34/4,  :re  — 4/4]  X  [^e  — 4/4,  ye  + 4/4]  and  the  square 

+  4/4,  x^  +  34/4]  X  -  4/4,  Ve  +  4/4]  in 
BiXs-,  Tj)/2^  Xs)  \  and  =  0  otherwise.  For  a  vertical 
edge  e,  the  definition  of  Ae  is  similar,  where  the  horizontal 
and  vertical  coordinates  are  switched  in  the  above  two 
events. 

Next,  we  define  the  random  field  B^.  For  an  edge  e  in  £, 
Be  =  lif  Ae  =  I  and  the  following  two  events  are  true: 

(i)  there  is  no  primary  transmitter  within  distance  Rj  of  any 
secondary  node  of  the  three  crossings  in  the  definition  of 

Ae-, 

(ii)  there  is  no  primary  receiver  within  distance  rj  of  any  sec- 
ondary node  of  the  three  crossings  in  the  definition  of  A^ ; 

and  Be  =  0  otherwise.  It  follows  from  the  definition  of  com- 
munication link  in  the  secondary  network  (see  Section  II-B-I) 
that  if  =  1  and  Be  =  I,  then  the  three  crossings  in 
B{Xsj  \s)  are  also  three  crossings  in  ^(A^,  Apt)- 

Let  Ce  —  AeBe,  then  we  claim  that  the  edge  e  is  open  if 
Ce  =  I  and  e  is  closed  if  Ce  =  0.  We  observe  from  Fig.  1 1 
that  whether  the  edge  e  is  open  is  correlated  with  the  states  of 
the  other  edges.  This  model  jC  thus  is  a  dependent  edge-perco- 
lation model.  Furthermore,  as  shown  in  Fig.  12,  if  there  exists 
an  infinite  open  connected  component  in  £,  then  those  cross- 
ings associated  with  the  edges  in  the  infinite  component  in  C 
comprise  an  infinite  connected  component  in  Q{Xs^  Apt)-  As  a 
consequence,  by  considering  the  uniqueness  of  the  infinite  con- 
nected component  in  ^(As,  Apt),  we  only  need  to  prove  the 
following  lemma  in  order  to  show  T2.1. 

Lemma  3:  Let  C{0)  denote  the  open  connected  component 
containing  the  origin  O  in  C  Then  given  Xs  >  Xc{rp),  3  D  > 
0,  X%rr  >  0  such  that  for  de  =  D  and  any  Apt  <  Ap^,  we  have 


vpT  >  0  such  that  for  de  =  D  and  any  Apt  <  A  J 
Pr{|C(0)|  =  oc}  >  0 
where  |C(0)|  is  the  number  of  edges  in  C{0). 


Fig.  12.  Percolation  in  C  (thick  segments  are  open  edges  in  C  and  thin  seg- 
ments are  closed  edges  in  C  and  blue  curves  are  those  crossings  associated  with 
the  open  edges). 


Proof  of  Lemma  3:  For  an  arbitrary  edge  e  in  £,  let  g  = 
Pr{Ce  =  0},  then  we  have 

q  =  Pr{(Ae  =  0)  U  {Be  =  0)}  <  Pr{Ae  =  0}  +  Pr{Be  =  0}. 

From  the  result  on  the  crossing  probabilities  given  in  (9),  we 
know  that  when  Xs  >  Xc{rp), 

PrjAe  =  0}  =Pr{at  least  one  crossing  does  not  exist} 
<  \l-a(('^/-^]^Xs.L. 
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►  oc,  i.e.,     lim    Prj^e  =  0}  =  0. 

de   >  OO 


Thus,  when  Xs  >  Xc{rp),  \/e>0,3D>0  such  that  Pr{^e  = 
0}  <  f  for  4  =  D. 

Given  Ae  =  I,  let  Sr^^  be  the  area  of  the  region  covered  by 
the  circles  with  radii  Rj  centered  at  those  secondary  nodes  in 
the  three  crossings  and  Srj  be  the  area  of  the  region  covered  by 
the  circles  with  radii  r/  centered  at  those  secondary  nodes  in  the 
three  crossings.  Then  we  have 

Pr{Be  =  0  I  ^e  =  1} 

=  Pr{3  some  primary  transmitter  in  Sr^} 
+  Pr{3  some  primary  receiver  in  Srj}. 


Since  Srj  < 


f  3de 


+  2Ri  +  rp){^+2Ri  +  rp)  and 


<  -f  2ri  -h  Vp)       -h  2ri  -\-  r^),  it  follows  from 

the  basic  property  of  Poisson  point  processes  that 


Pr{5e  =  0 
<  1  —  exp 

+  1  —  exp 


Ae  =  l} 

—  XpT 

—  XpT 
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Obviously,     lim     Pr{Be  =  0  |       =  1}  =  0  for  fixed  d^. 

A pT  — ^  0 

Thus  if  we  choose  de  —  D,  then  Ve  >  0,  3  Ap^  >  0  such  that 

Pr{Be  =  0  I  .4e  =  1}  <  ^  for  all  Apt  <  A^.^- 
It  implies  that  when  de  =  D,  for  all  Apt  <  ^pt 

Pr{Be  =  0}  =  Fv{Ae  =  0} +Pr{Be  =  0  Me  =  l}Pv{A,  =  1} 

<  Pr{Ae  =  0}  +  Pr{Se  =  0  I      =  1} 
2e 

Thus,  for  de  =  D  and  all  Apt  <  Ap^,  we  have 

q  <  Pr{Ae  =  0}  +  Pr{5e  =  0}  <  e.  (11) 

From  Fig.  1 1 ,  we  can  see  that  if  de  >  max{4i?7  +  2rp ,  4r/  + 
2rp},  then  the  state  of  edge  e  is  only  correlated  with  its  six  ad- 
jacent edges  and  it  is  independent  of  other  edges.  In  this  case, 
by  using  the  "Peierls  argument"  [25,  Chapter  1],  we  can  show 
that  if  the  probability  of  an  edge  being  closed  q  <  ii-2\/Io  ^  ^ 
then 


4  D      ^  Primary  Transmitter 
•  ^^-^    •  Secondary  User 


Fig.  13.  Counterexample  for  choosing  Rpt  —  Ri-  All  the  secondary  nodes 
in  the  infinite  path  are  located  outside  those  circles  centered  at  the  primary 
transmitters  with  radii  Ri ,  which  form  a  series  of  rings  surrounding  the  origin 
O  and  there  is  no  infinite  vacant  component  in  the  Poisson  Boolean  model 
B{XpT,  Ri,  Apt)  driven  by  the  primary  transmitters. 


Fig.  14.  Infinite  path  in  the  secondary  network.  The  dashed  segments  form  an 
inner  bound  on  the  infinite  vacant  component  in  the  Poisson  Boolean  model 
driven  by  the  primary  receivers. 


Pr{|C(O)|  =  oo}>0.  (12) 

The  proof  of  the  above  statement  follows  the  one  of  [7,  Theorem 
3]  except  that  the  upper  bound  on  the  probability  of  n  edges  all 
being  closed  is  given  by  the  following  fact. 

Fact  1:  [8,  Proposition  1]:  For  any  collection  {e^}f^^  of  n 
distinct  edges  in  £,  we  have 

Pr{(Ci  =  0)  n  (C2  =  0)  n . . .  n  (a  =  o)}  < 

where  Ci  is  the  indicator  of  being  closed,  and  q  is  the  prob- 
ability of  an  edge  being  closed. 

Thus,  by  combining  (12)  with  (11),  we  conclude  that  given 
As  >  Ac(rp),  3  D  >  0,  Ap^  >  0  such  that  for  fixed  de  = 
max{D,  ARj  +  2rp,  4r/  +  2rp}  and  any  Apt  <  Ap^. 

Pr{|C(0)|  =  oo}  >  0. 

Notice  that  Ap^  depends  on  D  which  is  chosen  according  to  the 
crossing  probability  and  is  determined  by  A5.  As  a  consequence, 
Ap^  is  a  function  of  A5,  i.e.,  Ap^  =  Ap^(A5).  ■ 
Proof  of  T2.2:  From  the  conditions  for  the  existence  of 
a  communication  link  in  the  secondary  network  specified  in 
Section  II-B-I,  we  know  that  for  every  secondary  node  in  an  in- 
finite connected  component,  there  can  exist  neither  any  primary 
transmitter  within  distance  Rj  of  it  nor  any  primary  receiver 
within  distance  rj  of  it.  In  other  words,  every  secondary  node 
in  an  infinite  connected  component  must  be  located  outside 
all  the  circles  centered  at  the  primary  transmitters  and  the 
primary  receivers  with  radii  Rj  and  r/,  respectively.  Thus, 
if  there  is  an  infinite  connected  component  in  the  secondary 
network,  then  an  infinite  vacant  component  must  exist  in 
the  two  Poisson  Boolean  models  B{XpT^  RpT-,  ^pt)  and 
B{XpR^  RpRy  Apt)  driven  by  the  primary  transmitters  and 
the  primary  receivers,  respectively.  Here  Rpt  and  RpR  are 


some  appropriate  radii  which  will  be  specified  later.  A  natural 
choice  for  Rpp  is  Rj,  but  if  we  consider  the  counterexample 
given  in  Fig.  13,  then  we  can  clearly  see  that  even  if  there  is  an 
infinite  path  in  the  secondary  network,  no  infinite  vacant  com- 
ponent exists  in  the  Poisson  Boolean  model  B{XpTy  Rij  Xpr) 
driven  by  the  primary  transmitters.  Similarly,  counterexamples 
can  be  easily  constructed  for  choosing  Rpr  =  rj. 

Suppose  there  is  an  infinite  connected  component  in  the  sec- 
ondary network.  Then  we  can  find  a  sequence  of  secondary 
users  {iSi,  ^'2,  aS's,  . . .}  such  that  they  comprise  an  infinite  path 
starting  from  Si  (see  Fig.  14). 

Assume  that  Si  and  S'i+i  (^^  >  1)  are  two  adjacent  secondary 
nodes  in  the  above  infinite  path.  Notice  that  the  distance  fi^,  ^+l 
between  Si  and  S'^+i  satisfies  di^  ^+1  <  Vp  <  rj,  where  the 
second  inequality  <  rj  follows  from  the  fact  that  the  min- 
imum transmission  power  for  successful  reception  is  in  general 
higher  than  the  maximum  allowable  interference  power. 

As  we  know,  all  the  primary  receivers  must  be  outside  the 
two  circles  with  radii  rj  centered  at  Si  and  S'^+i,  respectively, 
as  shown  in  Fig.  15.  Given  e  >  0,  consider  the  rectangle 
— i^i+i^  ^^^^  X  [— e,  e]  between  Si  and  Si^i.  By  a 
simple  computation  in  geometry,  we  have  that  the  minimum 
distance  from  all  the  primary  receivers  to  the  rectangle  is 

\l t'j  —  ^  —  e.  As  illustrated  in  Fig.  14,  it  implies  that  there 
exists  an  infinite  vacant  component  in  the  Poisson  Boolean 

model  B  ^Xpp,  ^Jt'j  —  ^  —  e,  Apt^  driven  by  the  primary 

receivers  10.  By  recalling  the  known  results  in  Section  V-A-II, 
we  thus  conclude  that  for  all  e  >  0 

APT<  (2^r|-r|/4-e)~'Ae(l). 

^•^This  technique  used  here  can  also  be  applied  to  the  case  when  >  rj, 
where  only  the  minimum  distance  from  all  the  primary  receiver  to  the  bar  be- 
tween Si  and  5^+1  needs  to  be  recomputed. 
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Fig.  15.  One  edge  {Si,  Si+i)  in  the  infinite  path. 


Let  e  — >  0,  then  it  yields 


< 


4r| 


rA.(l). 


The  other  term  ^j^2_^2  Ac(l)  in  the  upper  bound  is  obtained 
by  applying  the  same  argument  to  the  Poisson  Boolean  model 
driven  by  the  primary  transmitters. 

D.  Proof  of  Theorem  3 

Consider  the  connected  component  Ca  containing 
an  arbitrarily  chosen  secondary  user  A.  Assuming  that 
\Ca\  >  1,  we  construct  a  branching  process  as  follows. 
Notice  that  if  \Ca\  >  1,  then  A  must  see  an  opportunity,  i.e., 
I(^,r/,rx)  n  l(A,i^7,tx)  is  true.  Call  A  the  initial  point  (or 
0-th  generation)  of  the  branching  process.  Then  the  children 
of  A  (i.e.,  the  first  generation  of  the  branching  process)  are 
secondary  users  which  satisfy  the  following  two  conditions: 

i)  it  is  within  distance  Vp  of  A,  where  Vp  is  the  transmission 
range  of  secondary  users; 

ii)  there  exist  neither  any  primary  receiver  within  distance  rj 
of  the  secondary  user  nor  any  primary  transmitter  within 
distance  Rj  of  the  secondary  user. 

The  n-th  (n  >  2)  generation  of  the  branching  process  is  ob- 
tained similarly  and  these  children  are  connected  to  their  par- 
ents in  the  (n  —  l)-th  generation  of  the  branching  process  via 
bidirectional  links.  Obviously,  all  the  secondary  users  in  Ca  are 
counted  in  the  constructed  branching  process  model.  But  some 
of  them  may  probably  be  counted  more  than  once,  since  we  do 
not  exclude  the  previous  n  generations  (including  generation 
0)  when  we  consider  the  n-th  generation.  Thus,  this  branching 
process  gives  us  an  upper  bound  on  the  number  of  secondary 
users  in  Ca-  It  follows  that  if  the  branching  process  does  not 
grow  to  infinity  wpp.,  then  there  does  not  exist  an  infinite  con- 
nected component  a.s.  in  G{)^s^  ^pt)^  due  to  the  stationarity 
of  the  heterogeneous  network  model.  Since  the  conditional  av- 
erage degree  ^  is  the  average  number  of  offspring  for  every  gen- 
eration, the  necessary  condition  follows  immediately  from  the 
classic  theorem  for  branching  processes [1,  Theorem  2.1.1]. 

^^The  lower  bound  on  the  conditional  average  degree  can  be  improved  by 
constructing  a  multi-type  branching  process.  Specifically,  when  we  consider  the 
children  of  a  secondary  user,  we  categorize  them  into  different  types  based  on 
their  distances  from  the  secondary  user.  A  similar  construction  has  been  used 
to  obtain  the  lower  bound  on  the  critical  density  of  Poisson  Boolean  models  [1, 
Sec.  3.9].  This  multi-type  branching  process  is,  however,  difficult  to  analyze  and 
requires  substantial  further  investigation. 


E.  Proof  of  Theorem  4 

From  the  construction  of  the  dependent  site-percolation 
model  £,  we  know  that  the  existence  of  an  infinite  occupied 
component  in  C  implies  the  existence  of  an  infinite  connected 
component  in  ^(A^,  Apt)  -  Then  in  order  to  obtain  a  sufficient 
condition  for  the  connectivity  of  the  secondary  network,  it 
suffices  to  find  a  sufficient  condition  for  the  existence  of  an 
infinite  occupied  component  in  jC. 

Let  p  be  the  probability  that  one  site  is  occupied.  Then  based 
on  the  definition  of  the  upper  critical  probability  pc  of  jC,  we 
have  that  if  p  >  pc,  Sin  infinite  occupied  component  containing 
the  origin  exists  in  C  wpp.  It  implies  that  if  p  >  pc,  there  exists 
an  infinite  connected  component  in  the  secondary  network  wpp. 
Since  the  event  that  there  exists  an  infinite  connected  component 
in  the  secondary  network  is  invariant  under  the  group  of  shift 
transformations,  it  follows  from  the  ergodicity  of  the  heteroge- 
neous network  model  (see  Lemma  1)  that  ifp  >  Pc,  there  exists 
an  infinite  connected  component  in  the  secondary  network  a.s. 

Let  Fk{Bo)  denote  the  event  that  there  exist  exactly  k  sec- 
ondary users  in  the  associated  dashed  square  Bq  of  a  site  O  and 
let  Gi{Bo)  {f  ^  i  ^  k)  denote  the  event  that  the  secondary 
user  Ai  sees  an  opportunity.  Then  based  on  the  definition  of  oc- 
cupied site  in     we  obtain  a  lower  bound  on  p  as  follows: 

CO 

p=  ^Pr{Ffc(i?o)}Pr{utiG,;(Bo)} 

k=l 

OO 

>  ^Pr{Ffc(Bo)}Pr{Gi(Bo)} 

k=i 

oo 

=  Pr{Gi(Bo)}^Pr{Ffc(Bo)} 

k=i 

=  [1  -  exp{-Xsdl)]FT  |l(Ai,r/,rx)  n  1  (^i,  i^/,  tx)} 


1  —  exp  — 


exp{-XpT7r[Rj+rj-I{Ri,Rp,ri)]}. 


In  the  last  step,  Pr  |l  (v4i,  r/,  rx)  fl  I  (^i,  i?/,  tx)  j  has  been 
obtained  by  setting  the  distance  d  =  0  in  the  expression  for  the 
probability  of  an  unidirectional  opportunity  between  two  sec- 
ondary users  with  distance  d  apart  given  in  [16,  Proposition  1]. 

VI.  Conclusion  and  Future  Directions 

We  have  studied  the  connectivity  of  a  large-scale  ad  hoc  het- 
erogeneous wireless  network  in  terms  of  the  occurrence  of  the 
percolation  phenomenon.  We  have  introduced  the  concept  of 
connectivity  region  to  specify  the  dependency  of  connectivity 
on  the  density  of  the  secondary  users  and  the  traffic  load  of 
the  primary  users.  We  have  shown  several  basic  properties  of 
the  connectivity  region:  the  contiguity  of  the  region,  the  mono- 
tonicity  of  the  boundary  and  the  uniqueness  of  the  infinite  con- 
nected component.  We  have  analytically  characterized  the  crit- 
ical density  of  the  secondary  users  and  the  critical  density  of 
the  primary  transmitters;  they  jointly  specify  the  profile  of  the 
connectivity  region.  We  have  also  established  a  necessary  and  a 
sufficient  condition  for  connectivity,  which  give  an  outer  and  an 
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Fig.  16.  Illustration  of  Sj{t,ri,r2)  (the  common  area  of  two  circles  with  radii 
ri  and  r2  and  centers  t  apart),  Su2  (t,  ,  (the  union  area  of  two  circles  with 
radii  ri  and  r2  and  cents  t  apart)  and  5/2 0,  Rp,t,  rj)  (the  intersection  area 
between  one  circle  with  radius  Rp  and  the  union  of  the  two  identical  circles 
with  radii  rj). 


inner  bound,  respectively,  on  the  connectivity  region.  Further- 
more, by  examining  the  impacts  of  the  secondary  users'  trans- 
mission power  on  the  connectivity  region  and  on  the  conditional 
average  degree  of  a  secondary  user,  we  have  demonstrated  the 
tradeoff  between  proximity  and  spectrum  opportunity.  In  es- 
tablishing these  results,  we  have  used  techniques  and  theories 
in  continuum  percolation,  including  the  coupling  argument,  er- 
godic  theory,  the  discretization  technique,  and  an  approximation 
using  a  branching  process. 

To  highlight  the  interaction  between  the  primary  users  and  the 
secondary  users  in  a  heterogeneous  network,  we  have  ignored 
the  fading  effect  and  the  mutual  interference  between  secondary 
users.  If  we  take  into  account  these  factors,  then  the  received 
signal  to  interference-plus-noise  ratios  at  two  secondary  users 
will  replace  the  distance  between  them  in  condition  (CI)  for  the 
existence  of  a  communication  link  between  them.  This  will  re- 
sult in  a  random  connection  model  (RCM)  with  correlated  links, 
where  the  correlation  between  links  is  due  to  the  mutual  inter- 
ference and  condition  (C2)  on  the  presence  of  the  bidirectional 
opportunity.  There  will  be  a  tradeoff  between  proximity  and  mu- 
tual interference  in  addition  to  the  tradeoff  between  proximity 
and  opportunity  and  the  combination  of  these  two  tradeoffs  will 
complicate  the  characterization  of  the  connectivity  of  the  sec- 
ondary network.  But  since  the  RCM  shares  several  basic  prop- 
erties (e.g.,  ergodicity  and  existence  of  the  critical  density)  with 
the  Boolean  model  used  in  this  paper,  we  expect  that  most  of  the 
results  established  here  (e.g.,  connectivity  region,  critical  den- 
sities) can  be  extended  to  the  RCM. 


Appendix  A 
Expression  for  Conditional  Average  Degree 

An  expression  for  the  conditional  average  degree  /Lt  of  a  sec- 
ondary user  is  presented  in  the  following  proposition. 


Proposition  1:  Let  Xs  and  A pt  be  the  density  of  secondary 
users  and  primary  transmitters.  Let  rj  and  Rj  be  the  interfer- 
ence range  of  the  secondary  and  primary  users  and  and  Rp 
the  transmission  range  of  the  secondary  and  primary  users.  Let 
g{XpTy  rp^ri^  Rp^  Ri)  denote  the  probability  of  a  bidirectional 
opportunity  between  A  and  B  conditioned  on  the  event  that  A 
sees  an  opportunity.  Then  the  conditional  average  degree  of  a 
secondary  user  is  given  by 

/x=  (A^Trr^)  '  g(XpT,rp,ri,Rp,Ri) 


r^2t  I 


^{r]^R]  +  I{Ri,Rp,ri)) 


Si{t,ri,ri)-Si{t,Ri,Ri) 
Si2{r,6,Rp,t,ri) 


II 


Su2it,Ri  ,Ri) 


rdrdO 


(Al) 


where 


I{Ri,Rp,ri) 


pRi 

=  2 

Jo 


ttRI 


Sj{t^  ri,  the  common  area  of  two  circles  with  radii  ri  and 
r2  and  centers  t  apart  [see  Fig.  16(a)]  and  Su2{tj  '^2)  is  the 
union  of  two  circles  with  radii  ri  and  r2  and  centers  t  apart  [see 
Fig.  16(b)].  Si2{r^  0^  Rp^t^  rj)  is  the  intersection  area  between 
one  circle  with  radius  Rp  and  the  union  of  the  two  circles  with 
both  radii  r/  [see  Fig.  16(c)].  For  Sj2{r^0^  Rp^t^rj),  the  two 
identical  circles  are  centered  t  apart  and  the  other  circle  is  cen- 
tered at  (r,  ^),  where  the  middle  point  of  the  centers  of  the  two 
identical  circles  is  chosen  to  be  the  origin  O. 

Expressions  for  I(Ri^Rp^ri)  and  ri,r2)  can  be  ob- 
tained in  explicit  form,  which  can  be  found  in  [1,  Appendix  A]. 
The  expression  for  Si2{r^  6^  Rp-,t-,  tj)  depends  on  the  expres- 
sion for  the  common  area  of  three  circles  which  is  tedious  and 
is  given  in  [26] .  By  applying  the  basic  property  of  the  exponen- 
tial function  to  (Al),  we  can  easily  show  that  ^(•)  is  a  strictly 
decreasing  function  of  Apt- 

Proof:  Let  Fj^{A)  denote  the  event  that  there  exist  exactly  k 
neighbors  of  a  secondary  user  A.  We  thus  have  the  first  equation 
shown  at  the  bottom  of  the  next  page. 

When  =  0,  it  is  obvious  that  deg(A)  =  0.  When  A;  >  0,  let 
Bi  be  a  neighbor  of  A  and  Ipi  an  indicator  function  for  Bi  such 
that  1^^  =  1  if  1(5^,  r/,  rx)  D  i^/,  tx)  occurs  and  Ipi  = 
0  otherwise.  Then  by  considering  the  statistical  independence 
and  equivalence  of  the  k  secondary  users,  we  have  the  second 
equation  shown  at  the  bottom  of  the  next  page.  It  follows  that 
we  get  (A2),  as  shown  at  the  bottom  of  the  next  page. 

According  to  the  definition  of  spectrum  opportunity, 
Pr{l(74,  r/,  rx)  D  1{A^  Rj^tx)}  can  be  obtained  by  setting 
the  distance  d  =  0  in  the  expression  for  the  probability  of  a 
unidirectional  opportunity  between  two  secondary  users  with 
distance  d  apart  given  in  Proposition  1  in  [16]. 


Pr{l(^,r7,rx)  fl  l(^,i?/,tx)} 
=  exp[-ApT7r(r|  - 


Rj-I(Ri,Rp,rj))]  (A3) 
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Next,  we  derive  the  expression  for  the  probabiHty 
of  a  bidirectional  opportunity,  i.e.,  Pr{l(5i, r/, rx)  H 
l(5i,i?/,tx)nD(v4,r/,rx)  fl  n(v4,  i?/,  tx)},  which  depends 
on  the  location  of  Bi  only  through  its  distance  to  A.  Since 
is  uniformly  distributed  within  distance  of  A,  the  density 
function  of  the  distance  t  between  Bi  and  A  is  given  by  ^ 
for  0  <  t  <  VpAn  this  case,  the  probability  of  a  bidirectional 
opportunity  can  be  written  as  shown  in  (A4),  also  shown  at 
the  bottom  of  the  page,  where  the  integrand  can  be  written  as 
shown  in  (A5),  at  the  bottom  of  the  page. 


Next,  we  compute  the  two  probabilities  in  (A5)  one  by  one. 
Since  the  primary  receivers  admit  a  Poisson  point  process  with 
density  Apt,  we  have 


Pr{l(Bi,r/,rx)  H  D(A^/,rx)|  d{Bi,A)  =  t} 

=  exp[-ApT(2^r|  -  Si{t,  n,  n))]  (A6) 

where  rj^rj)  is  the  common  area  of  two  circles  with  both 
radii  rj  and  centers  t  apart  [see  Fig.  16(a)]. 


li=E[deg{A)  \  D(A,r/,rx)  fl  n(^,i^/,tx)] 
=  Ek[E[deg{A)  \  D(Ar/,rx)  n  l(Ai^/,tx)  n  Fk{A)]] 

=  ^  e-^s^-l^^^^E[deg{A)  \  l(Ar7,rx)  n  l{A,Ri,tx)  fl  Fk{A)] 

k=0 


E[deg{A)  \       r/,  rx)  n  D(A,  Rj,  tx)  n  Fk{A)] 

k 

=  ^E[lp,  I  D(A,r/,rx)nl(Ai^/,tx)] 
i=i 

=  A;E[lpi  I  l(^,r/,rx)  fl  D(A,i^7,tx)] 

=  A;Pr{l(Bi,r7,rx)  H  l(Ei,i?/,tx)  |  l(.4,r/,rx)  H  D(A,i^/,tx)} 
_  ^Pr{l(Bi,r/,rx)  H  l(Ei,i^7,tx)  n  l(^,r/,rx)  fl  D(A,i?/,tx)} 
Pr{l(^,r/,rx)  n  D(^,i^/,tx)} 


_2Pr{D(Bi,r/,rx)  H  D(Bi,i?/,tx)  H  D(A,r/,rx)  n  l(^,i^/,tx)} 
^  Pr{D(A,r/,rx)  n  l(A,i^/,tx)} 


Pr{D(Bi,r/,rx)  n  l(Bi,i^/,tx)  fl  D(A,r/,rx)  n  l(^,i^/,tx)} 

rp  2t         

=  /     — Pr{l(Bi,r/,rx)  H  l(Bi,i?/,tx)  H  l(A,r/,rx)  fl  D(A,i?/,tx)|  d{Bi,A)  =  t}dt  (A4) 
Jo 


Pr{n(Bi,r7,rx)  fl  D(Bi,i?7,tx)  fl  n(^,r/,rx)  fl  D(^,i?7,tx)  |  =  t} 

=  Pr{D(Bi,i?/,tx)  n  D(A,i?/,tx)  I  D(Bi,r/,rx)  H  I(v4,r/,rx)  fl  A)  =  t} 

•  Pr{l(Ei,r7,rx)  H  D(yl,r/,rx)  |        ,A)=t}  (A5) 
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Pr{l(Ei,i?7,tx)  n  D(^,i?7,tx)  I  l(Bi,r/,rx)  H  l(^,r/,rx)  H  A)  =  t} 
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=  exp  < 


exp  < 
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(A8) 


Let  XpT  denote  the  Poisson  point  process  formed  by 
primary  transmitters.  If  we  remove  from  Xpp  primary  trans- 
mitters whose  receivers  are  within  distance  rj  of  Bi  or  A, 
then  it  follows  from  the  Coloring  Theorem  [17,  Chapter  5]  that 
all  the  remaining  primary  transmitters  form  another  Poisson 
point  process  with  density  Apt  1  —  ^?i^iri^2^2^i!2l j  ^  where 
*^/2(^5  O-fRp-it^rj)  is  the  area  of  the  circle  with  radius  Rp  and 
centered  at  (r,  6^)  intersecting  the  two  circles  both  with  radii 
rj  and  centers^  apart  [see  Fig.  16(c)].  We  thus  have  (A7),  at 
the  top  of  the  page,  where  Su2{t^  Riy  Ri)  is  the  union  of  two 
circles  with  both  radii  Rj  and  centerst  apart  [see  Fig.  16(b)]. 

Substitute  (A6),  (A7)  into  (A5),  we  have  (A8),  as  shown  at 
the  top  of  the  page.  The  expression  for  the  conditional  average 
degree  /x  thus  follows  by  plugging  (A8)  into  (A4)  and  then  (A3), 
(A4)  into  (A2). 


Appendix  B 
Proof  of  Corollary  3 

From  [1,  Appendix  A]  and  Figs.  16(b)  and  (c),  we  know  that 
when  rj  >  Rp  -\-  Rj 


I{Ri,Rp,ri)=Rj 
Si2{r,0,Rp,t,ri) 


(Bl) 


Su2(t,Ri,Ri) 


^Rl 


rdrdO  =  Su2{t,  Ri,  Ri) 

=  27rRj  -  Si{t,Ri,Ri). 

(B2) 


Substitute  (Bl)  and  (B2)  into  (Al),  we  have 

li^Xs-Krl  /      —  exp[-ApT(7rr|-S'/(t,r/,r/))]dt.  (B3) 
Jo 
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Plugging  the  expression  for  Si{t^ri^ri)  [1,  Appendix  A]  into 
(B3)  yields  the  equation  shown  at  the  top  of  the  previous  page. 

By  applying  the  inequality  arccos(rc)  <  |— :r  forO  <  re  <  1, 
we  have  the  equation  at  the  top  of  the  previous  page,  where  we 
have  assumed  that  Vp  =  /?r/  (0  <  /?  <  1)  under  the  disk  signal 
propagation  and  interference  model.  Since  r/  oc  {ptxY^^^  we 
arrive  at  Corollary  3. 
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Summary  Seizure  prediction  has  proven  to  be  difficult  in  clinically  realistic  environments.  Is  it 
possible  that  fluctuations  in  cortical  firing  could  influence  the  onset  of  seizures  in  an  ictal  zone? 
To  test  this,  we  have  now  used  neural  network  simulations  in  a  computational  model  of  cortex 
having  a  total  of  65,536  neurons  with  intercellular  wiring  patterned  after  histological  data.  A 
spatially  distributed  Poisson  driven  background  input  representing  the  activity  of  neighboring 
cortex  affected  1%  of  the  neurons.  Gamma  distributions  were  fit  to  the  interbursting  phase 
intervals,  a  non-parametric  test  for  randomness  was  applied,  and  a  dynamical  systems  analysis 
was  performed  to  search  for  period- 1  orbits  in  the  intervals.  The  non-parametric  analysis  sug- 
gests that  intervals  are  being  drawn  at  random  from  their  underlying  joint  distribution  and  the 
dynamical  systems  analysis  is  consistent  with  a  nondeterministic  dynamical  interpretation  of 
the  generation  of  bursting  phases.  These  results  imply  that  in  a  region  of  cortex  with  abnormal 
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connectivity  analogous  to  a  seizure  focus,  it  is  possible  to  initiate  seizure  activity  with  fluctu- 
ations of  input  from  the  surrounding  cortical  regions.  These  findings  suggest  one  possibility  for 
ictal  generation  from  abnormal  focal  epileptic  networks.  This  mechanism  additionally  could  help 
explain  the  difficulty  in  predicting  partial  seizures  in  some  patients. 
©  2011  Elsevier  B.V.  All  rights  reserved. 


Introduction 

Epileptic  seizures  are  brief,  episodic  phenomena.  Partial 
seizures,  the  most  common  seizure  type,  arise  from  focal 
brain  regions  (e.g.  temporal,  parietal)  (Niedermeyer,  2005). 

While  in  some  instances  there  may  be  an  identifiable 
cause  for  the  seizures  (e.g.  tumor,  cavernoma,  hippocam- 
pal  sclerosis),  in  other  instances  no  clear  pathology  is 
determined.  The  hallmark  of  an  epileptic  seizure  is  the 
involvement  of  local  or  regional  neural  networks;  repeti- 
tive firing  of  a  single  neuron  does  not  produce  symptoms 
without  this  network  involvement.  What  causes  the  interic- 
tal  to  ictal  transition?  A  typical  partial  seizure  lasts  less  than 
2  min  plus  any  postictal  state  (Afra  et  al.,  2008).  Therefore, 
even  if  a  patient  has  very  frequent  seizures,  the  majority 
of  time  is  spent  in  the  interictal  state.  While  some  seizures 
can  be  provoked  or  are  more  likely  to  occur  under  certain 
situations  (e.g.  sleep  deprivation,  photic  stimulation),  the 
majority  of  seizures  appear  to  occur  spontaneously  without 
known  association  with  definable  influences. 

There  has  been  considerable  interest  in  seizure  predic- 
tion in  recent  years.  Obviously  if  seizures  could  be  reliably 
predicted,  then  the  option  for  targeted  therapy  exists  (e.g. 
stimulation),  or  at  least  the  patient  could  remove  them- 
selves from  potentially  dangerous  situations.  The  underlying 
hypothesis  for  seizure  prediction  is  that  there  are  changes 
in  cerebral  dynamics  that  may  precede  the  clinical  seizure 
by  minutes  to  hours  (reviewed  in  Sackellares,  2008).  These 
changes  may  be  local  (i.e.  near  the  seizure  focus)  or  remote. 
These  changes  are  not  apparent  with  visual  analysis  of  the 
EEG,  even  with  intracranial  recording  arrays.  Some  groups 
have  identified  high  frequency  activity  that  may  signal  the 
onset  of  neocortical  partial  seizures,  but  this  is  an  exam- 
ple of  improved  seizure  detection,  not  prediction  (Worrell 
et  al. ,  2004,  2008;  Bragin  et  al. ,  201 0).  Reliable  seizure  pred- 
ication has  been  challenging  and  even  the  most  enthusiastic 
proponents  of  the  prediction  hypothesis  acknowledge  the 
difficulties  with  current  algorithms  (Lehnertz  et  al.,  2007; 
Mormann  et  al.,  2007;  Andrzejak  et  al.,  2009). 

Seizure  prediction  may  be  difficult  due  to  rapid  bistable 
state  changes  at  the  time  of  ictal  onset  in  the  neocortex 
(Suffczynski  et  al.,  2006;  Lopes  da  Silva  et  al.,  2003). 

The  mechanisms  underlying  a  bistable  state  change 
may  be  quite  different  between  primary  generalized  (e.g. 
absence)  and  partial  epileptic  seizures.  A  bistable  state 
change  may  be  more  applicable  to  these  primary  general- 
ized seizures  which  have  abrupt  bilateral  cerebral  onset. 
In  this  paper,  a  different  possible  mechanism  is  presented 
under  which  seizure  prediction  would  be  difficult  in  some 
patients  with  focal  seizure  onset. 

Knowing,  as  we  do,  that  partial  seizures  are  a  reflec- 
tion of  transient  abnormal  regional  network  activity,  it 
is  reasonable  to  postulate  that  these  seizures  in  at  least 
some  (perhaps  many)  patients  result  from  abnormal  neural 


networks  (e.g.  the  epileptogenic  zone)  (Jacobs  et  al.,  2000). 
We  describe  here  a  model  of  the  epileptogenic  zone  where 
the  epileptic  focus  is  represented  by  an  abnormal  neu- 
ral network  that  has  very  slightly  altered  connectivity  so 
that,  while  seizures  only  occur  infrequently,  they  can  be 
triggered  by  normal  background  activity  originating  from 
outside  the  epileptogenic  zone.  This  background  activity 
could  be  influenced  by  various  physiologic  factors  (e.g. 
sleep),  but  nevertheless  this  background  activity  would  not 
result  in  seizure  activity  in  the  non-epileptic  brain.  This 
does  not  discount  the  possibility  that  some  changes  in  neural 
network  synchrony  may  occur  in  the  ''normal"  brain  since 
the  cumulative  lifetime  incidence  of  unprovoked  seizures 
approaches  4%  (Hauser  et  al.,  1993).  Often  these  seizures 
are  provoked  (e.g.  medications  and  alcohol)  and  less  than 
half  of  these  patients  have  recurrent  seizures.  The  life- 
time cumulative  risk  of  developing  epilepsy  only  ranges 
from  1.4%  to  3.3%  (Krumholz  et  al.,  2007;  Berg  and  Shinnar, 
1991).  In  this  model,  however,  where  normal  background 
activity,  occasionally  or  rarely  produces  a  seizure  in  abnor- 
mal regional  networks,  seizure  prediction  would  be  difficult 
since  detectable  preictal  changes  would  not  be  present;  the 
first  changes  would  in  fact  be  seizure  initiation. 

Epileptic  networks  in  neocortex  or  the  hippocampus 
show  anatomical  changes  compared  to  normal  tissue  (Jacobs 
et  al.,  2000;  Sallin  et  al.,  1995).  These  changes  can  progress 
with  time  (Sallin  et  al.,  1995;  Arellano  et  al.,  2004).  This 
could  result  in  neuronal  networks  more  amenable  to  seizure 
generation  (electrical  or  clinical)  over  large  regional  areas. 
There  is  a  complex  interrelationship,  much  of  it  not  well 
understood,  between  neurons  which  are  dysfunctional  and 
the  neural  networks  which  can  promote  seizures  (Leussis 
and  Heinrichs,  2007;  Kumar  et  al.,  2007;  Swann  et  al., 
2007).  Even  in  the  non-epileptic  brain,  excitatory  connec- 
tions predominate  with  80-90%  of  synapses  being  excitatory 
(Braitenberg  and  Schuz,  1998). 

With  neuronal  network  simulations  it  is  possible  to  con- 
trol, study,  and  quickly  change  the  various  influences  on 
network  behavior.  Recently,  we  presented  the  results  of 
computational  simulation  studies  examining  the  role  of 
external  field  stimulation  on  ongoing  bursting  activity  in  a 
neural  network  (Anderson  et  al.,  2007,  2009).  The  corti- 
cal model  used  in  these  studies  consists  of  discrete  single 
compartment  Hodgkin-Huxley  type  cells  which  are  spa- 
tially arranged  in  a  realistic  fashion  having  both  a  layered 
and  columnar  structure.  Since  neural  network  behavior 
reflects  the  aggregate  output  of  the  component  neurons, 
single  compartment  neurons  allow  greater  computational 
efficiency  and  the  ability  to  model  larger  networks  in  studies 
of  network  behavior.  Arrangements  of  connected  simu- 
lated neurons  in  this  manner  can  demonstrate  spontaneous 
bursting  phases  and  have  spatial  characteristics  similar  to 
seizures  recorded  from  humans  (Anderson  et  al.,  2007,  2009; 
Kudela  et  al.,  1997,  2003a, b,  2005;  Franaszczuk  et  al.. 
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2003).  We  now  present  the  results  of  a  similar  neuronal 
network  model  with  random  surrounding  background  inputs. 
The  goal  of  this  study  was  to  investigate  the  statistical  struc- 
ture of  the  resulting  bursting  network  activity  to  seek  the 
presence  or  absence  of  predictable  patterns  in  the  behavior. 

Materials  and  methods 
Computational  model  format 

The  individual  neurons  in  this  neocortical  model  were  represented 
by  single  compartment  neurons  bearing  synaptic  connections  from 
the  rest  of  the  network,  and  embedded  with  a  fixed  set  of  ionic 
conductances.  The  membrane  potential  varies  as: 

dV 

^^m-^  =  'syn  -  'Na  -  'ca  -  'k  -  /K(Ca)  -  'a  -  ^L- 

The  individual  currents  include  the  input  synaptic  current  /syn, 
inward  sodium  and  calcium  currents  /na  and  /ca,  outward  potas- 
sium currents  including  a  delayed  rectifier  current  /k,  a  calcium 
dependent  potassium  current  /K(ca)>  a  transient  potassium  current 
/a,  and  a  leakage  current  l\_  (Av-Ron,  1994).  The  minicolumns  used 
in  the  simulation  consist  of  16  cells  with  intrinsic  intracolumnar 
wiring,  adapted  from  the  neocortical  work  of  Douglas  and  Martin 
(2004),  as  described  more  fully  in  previous  studies  (Anderson  et  al., 
2007,  2009).  This  is  both  for  its  ease  of  implementation  compu- 
tationally and  for  its  experimental  support  in  somatosensory  and 
visual  cortex  (Douglas  and  Martin,  2004).  The  geometry  imposed  on 
a  computational  model  becomes  relevant  when  studying  any  spa- 
tially dependent  effects  on  the  resultant  spreading  activity.  The 
minicolumns  in  this  simulation  have  a  25  ijum  center-to-center  spac- 
ing in  a  square  lattice  repeating  structure.  The  total  number  of 
cells  examined  was  65,536,  representing  a  simulated  cortical  sur- 
face area  of  1.6  mm  x  1.6  mm.  Fig.  1  demonstrates  a  schematic  of 
the  intracolumnar  excitatory  cell  connections  and  the  organization 
of  the  minicolumns  in  planar  space  as  well  as  snapshots  of  the 
resultant  activity  in  the  layer  ll/lll  pyramidal  cell  component  dur- 
ing model  bursting  activity.  The  model  connectivity  and  synaptic 
currents  are  described  further  in  the  Supplementary  material. 

The  base  connection  pattern  studied  in  this  report  is  represen- 
tative of  one  that  can  produce  robust  bursting  as  previously  studied 
(Anderson  et  al.,  2007).  The  numbers  of  extra-columnar  connec- 
tions formed  by  each  cell  class  are  presented  in  Supplementary 
Table  1 .  There  are  seven  cell  classes  modeled:  four  classes  of  exci- 
tatory cells  including  layer  ll/lll  pyramidal  cells,  layer  IV  stellate 
cells,  layer  V  pyramidal  cells,  and  layer  VI  pyramidal  cells,  and 
three  classes  of  interneurons  including  basket  cells,  double  bouquet 
cells,  and  chandelier  cells.  Most  of  the  model  changes  described 
in  the  described  studies  involve  alterations  in  connection  num- 
bers between  layer  ll/lll  pyramidal  cells,  one  of  the  known  robust 
horizontal  connections  systems  in  the  cortex  supporting  epileptic 
propagation  (Telfeian  and  Connors,  1998).  The  base  connection  for 
this  system,  ^2/3:2/3  =  178,  is  defined  as  the  number  of  layer  ll/lll 
pyramidal  cells  a  given  layer  ll/lll  pyramidal  cell  contacts  in  its 
axonal  distribution. 

The  model  in  general  illustrates  consistent  bursting  behavior, 
with  epochs  of  spontaneous  bursting  onset  and  cessation  given  a 
random  background  input  of  Poisson  based  charge  injection  to  1% 
of  the  cells  in  the  model.  This  is  an  effort  to  treat  the  underlying 
cortical  activity  as  input  from  neighboring  cortex,  with  the  model 
itself  treated  as  the  epileptic  focus  given  its  ability  to  produce  net- 
work bursting  epochs.  The  synaptic  input  used  for  the  background 
was  not  periodic  in  nature.  Average  rates  for  these  Poisson  distri- 
butions are  described  in  "Activity  changes  with  mean  background 
frequency"  section  and  Fig.  2.  The  synaptic  activations  used  for  the 


background  inputs  were  the  same  used  in  the  cell  to  cell  connec- 
tions, and  followed  the  same  rise  and  decay  times  appropriate  for 
postsynaptic  potentials. 

The  pseudo-random  number  generator  used  for  the  application 
of  the  noise  pulses  was  a  linear  congruential  generator  implemented 
with  the  C-function  drand48,  with  an  intrinsic  period  of  281  x  10^^. 
For  a  30  s  simulation  and  10~^  s  time-step,  this  function  was  called 
1.966  X  10^  times  for  1%  of  the  cells  undergoing  background  input. 
The  period  length  for  the  pseudo-random  generator  is  143,000  times 
larger  than  this  number. 

Statistics  and  analysis 

The  interbursting  phase  intervals  in  the  model  were  fit  with  a 
gamma  distribution  (Suffczynski  et  al.,  2005,  2006).  The  functional 
form  of  this  distribution  /(...)  is  given  by 


/(At)  =  {^"r{a)r^  At"-''  exp 


where  Ar  is  the  interbursting  phase  interval,  a  is  the  shape 
parameter,  ^  is  the  scale  parameter,  and  r(...)  represents  the 
gamma  function.  Parameters  were  estimated  using  the  AAATLAB 
function  gamfit  which  returns  maximum  likelihood  estimates  and 
95%  confidence  intervals  for  the  shape  and  scale  parameters.  A  non- 
parametric  test  of  randomness  was  used  to  attempt  to  establish 
whether  intervals  were  being  drawn  at  random  from  their  underly- 
ing joint  distribution.  This  was  based  on  the  circular  definition  of 
the  lag-1  serial  correlation  coefficient  (Wald  and  Wolfowitz,  1943). 
p-Values  were  computed  under  the  assumption  of  asymptotic  nor- 
mality of  the  test  statistic. 

A  method  for  the  detection  of  unstable  periodic  orbits  (of  period- 
1 )  in  successive  interbursting  phase  intervals  was  applied  (So  et  al., 
1996,  1997).  A  period-1  orbit  is  a  fixed  point  of  the  nonlinear  map 
expressing  the  evolution  of  the  state  of  a  system,  iterated  a  single 
time  (Guckenheimer  and  Holmes,  1983).  Intervals  were  embedded 
in  a  two  dimensional  state  space  and  10"^  sets  of  transformed  inter- 
vals were  obtained  after  randomization.  Dimensional  reduction  was 
instituted  using  circles  of  radius  9.4  ms  centered  along  the  diago- 
nal of  the  state  space  (Le  Van  Quyen  et  al.,  1997).  One  hundred 
surrogates  were  produced  to  test  the  significance  of  peaks  which 
appeared  along  this  diagonal.  The  surrogates  were  generated  using 
the  amplitude  adjusted  Fourier  transform  algorithm  (Theiler  et  al., 
1992).  This  shuffles  the  original  sequence  of  interbursting  phase 
intervals,  maintaining  the  original  amplitude  distribution  of  the 
data  while  approximately  matching  its  Fourier  power  spectrum. 

Results 

A  total  of  1600  s  of  discontinuous  20-  and  30-s  data  seg- 
ments were  obtained,  holding  the  base  connectivity  of  the 
layer  ll/lll  pyramidal  cells  to  layer  ll/lll  pyramidal  cells  at 
^2/3:2/3  =  178.  Only  the  random  number  seed  supplied  to  the 
background  input  generator  was  varied  for  each  of  these 
runs.  Additionally,  five  continuous  segments  of  data  were 
obtained  with  the  base  connectivity  set  at  1^213.113  =  Ml  of 
lengths  320  s,  250  s  and  N2/3:2/3  =  178  of  lengths  195  s,  140  s, 
and  208  s.  These  data  were  used  for  the  dynamical  sys- 
tems analysis  presented  below.  In  addition  to  these  data, 
sixteen  20-s  runs  were  obtained  with  the  model  while  vary- 
ing the  mean  background  input  frequency  at  the  base  level 
of  connectivity.  Five  20-s  runs  were  obtained  at  the  base 
connectivity  while  varying  the  temporal  pattern  of  the  back- 
ground input,  and  ten  20-s  runs  were  obtained  with  a  fixed 
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Figure  1  (A)  Representative  connectivity  of  the  excitatory  cellular  component  in  a  given  modeled  minicolumn,  wiring  after 
(Douglas  and  Martin,  2004).  (B)  Three  dimensional  arrangement  of  the  16x16  array  of  minicolumns  in  space.  (C)  Representative 
snapshots  of  evolving  activity  over  0.02s  in  the  layer  ll/lll  pyramidal  cell  component.  Each  pixel  represents  one  cell,  color  coded 
proportionally  to  the  number  of  action  potentials  fired  in  bins  of  1  /  1 00  of  a  second. 
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Figure  2  Network  activity  produced  by  sequential  increases  in  the  mean  frequency  of  the  applied  background  activity  (background 
synaptic  input  provided  to  a  fixed  1%  set  of  the  modeled  cells,  summed  layer  ll/lll  pyramidal  cell  action  potentials  in  10  ms  bins). 
The  model  exhibits  a  transition  from  episodic  bursting  to  a  very  regular  bursting  behavior  driven  by  the  background  input. 


sequence  of  background  input  while  varying  the  overall  layer 
ll/lll  to  layer  ll/lll  connectivity  (Nm-.m)- 


Activity  changes  with  mean  background  frequency 

These  experiments  were  performed  with  the  base  connec- 
tivity of  the  layer  ll/lll  pyramidal  cell  system  set  at  the 
base  value  of  Nz/siz/s  =  178.  If  the  mean  frequency  of  the 
applied  background  synaptic  input  is  varied  from  0.25/s 
up  to  10/s,  several  patterns  of  activity  become  apparent 
(Fig.  2).  At  0.25/s,  only  the  low  level  set  of  activity  pro- 
duced with  action  potential  production  by  the  background 
input  is  observed,  at  the  cells  where  the  input  takes  place. 
This  plot  essentially  demonstrates  the  Poisson-based  random 
network  activity  between  the  bursting  phases.  At  0.5/s,  spo- 
radic bursting  activity  transmitted  to  the  network  as  a  whole 
can  be  observed,  with  long  quiescent  epochs.  As  the  applied 
mean  background  frequency  is  further  increased,  longer  and 
longer  periods  of  constant  bursting  activity  can  be  observed 
up  to  5/s.  After  this,  a  second  activity  transition  is  observed, 
in  which  the  activity  changes  from  continuous  bursting  into 
short  periods  of  very  large  amplitude  bursting  (in  terms  of 
numbers  of  active  neurons)  punctuated  by  brief  periods  of 
quiescence.  At  10/s  it  appears  to  dominate  the  activity.  This 
implies  a  saturation  mechanism  in  this  class  of  connected 
network,  which  comes  into  play  after  a  critical  percentage 
of  cells  are  excited  per  time  step.  This  saturation  behav- 
ior is  again  an  intrinsic  property  of  the  fixed  network  being 
probed.  Additionally,  these  studies  imply  that  seizure  onset 
can  be  driven  by  neighboring  cortical  activity,  albeit  regu- 
lar patterns  that  might  not  be  typical  of  random  background 
input  activity  utilized  here. 


Netvy^ork  activity  altered  with  input  pattern 
changes 

Within  the  context  of  this  model,  it  is  possible  to  change  the 
random  pattern  of  connectivity  between  represented  cells, 
and  still  keep  the  total  number  of  connections  between  the 
various  cell  classes  constant.  By  varying  the  random  number 
seed  supplied  to  the  generator  distributing  the  connections, 
different  patterns  of  activity  can  be  demonstrated,  even 
with  the  same  application  of  underlying  cortical  activity 
applied  to  the  same  cells.  Examples  of  the  changes  in  activ- 
ity are  presented  in  Fig.  3A.  The  pattern  produced  ranges 
from  almost  constant  bursting  throughout  the  20  s  exam- 
ined, to  brief  periods  of  on  and  off  bursting.  Similarly,  the 
connectional  pattern  can  be  held  constant  along  with  the 
cells  in  which  the  background  activity  is  applied,  while  vary- 
ing the  random  number  seed  responsible  for  producing  the 
order  in  which  background  pulses  are  injected  into  the  cells. 
This  produces  similar  alterations  in  network  activity  demon- 
strated in  Fig.  3B,  and  can  include  several  time  scales  of 
bursting  epochs.  These  studies  imply  a  rich  dynamics  of 
stochastic  behavior  in  randomly  connected  neural  networks 
receiving  temporally  uncorrelated  background  input,  and 
again  point  toward  difficulties  in  predicting  when  the  burst- 
ing phases  might  begin. 

Network  activity  is  very  sensitive  to  numbers  of 
excitatory  connections 

Finally,  changing  the  numbers  of  connections  in  this  net- 
work model  can  produce  substantial  alterations  in  network 
behavior.  Fig.  4  presents  a  sequence  of  plots  of  the  layer 
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Figure  3  (A)  Network  activity  induced  by  varying  the  random  connectivity  pattern  between  cell  classes  (different  connectivity 
seeds,  N^°^^,  for  the  random  number  generator).  Numbers  of  action  potentials  in  layer  II /I II  pyramidal  cell  component,  10  ms  time 
bins.  (B)  Network  activity  induced  by  varying  the  random  time  sequence  of  background  synaptic  input,  N^^^.  In  these  experiments, 
all  cellular  connections  remain  fixed,  and  the  identity  of  the  cells  undergoing  background  synaptic  input  remain  fixed. 


Il/lll  pyramidal  cell  activity  (time-binned  action  potential 
numbers)  for  various  degrees  of  extracolumnar  connectivity 
between  the  layer  Il/lll  pyramidal  cells.  The  base  activ- 
ity explored  in  this  manuscript  is  shown  in  the  plot  with 
total  number  of  connections  held  at  Nz/siz/s  =  178  (between 
extra-minicolumnar  layer  Il/lll  pyramidal  cells).  A  rapid 
reduction  in  network  bursting  is  shown  for  a  connection  num- 
ber reduced  below  this,  and  almost  continuous  activity  is 
shown  for  connection  numbers  above  this.  Fig.  3A  and  B  data 
were  obtained  with  the  connectivity  set  at  Nz/siz/s  =  178. 
Only  the  random  pattern  of  connectivity  is  varied  in  Fig.  3A 
and  the  time  sequence  of  background  input  in  Fig.  3B.  These 
studies  were  performed  in  the  context  of  a  constant  aver- 
age level  of  surrounding  background  input,  and  imply  the 
importance  of  internal  connectivity  in  the  development  of 
uncontrolled  bursting  of  the  network. 


Statistical  analysis  of  interburst  phase  intervals 

The  statistical  properties  of  the  interburst  phase  intervals 
for  five  continuous  runs  of  the  model  at  connectivities  of 
^2/3:2/3  =  178  and  172  were  analyzed.  This  was  motivated 
from  a  dynamical  systems  perspective,  where  periodicity  in 
sequential  intervals  was  sought  for.  Fig.  5A  displays  a  his- 
togram approximation  to  the  probability  density  function 
for  interbursting  phase  intervals  for  Continuous  Run  1 ,  which 
consisted  of  163  intervals  collected  from  a  320  s  run  of  the 
simulation.  Fig.  5D  displays  the  same  histogram  approxi- 
mation for  Continuous  Run  4,  with  a  total  of  58  intervals 
collected  from  a  140  s  run  of  the  simulation.  Gamma  distri- 
butions were  used  to  fit  these  densities  (Fig.  5A  and  D  (blue 
traces))  (see  ''Materials  and  methods"  section;  Suffczynski 
et  al.,  2005,  2006).  In  the  case  of  Continuous  Run  1 
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Figure  4  Changing  the  absolute  connectivity  in  the  layer  ll/lll  pyramidal  cell  component  (number  of  layer  ll/lll  pyramidal  cells 
contacted  by  a  given  layer  ll/lll  pyramidal  cell,  N2/3:2/3)  in  the  model  produces  alterations  in  the  network  bursting  behavior.  At 
very  low  absolute  connectivity  (N2/3:2/3  =  110)  network  bursting  is  brief  and  isolated,  while  at  higher  levels  of  absolute  connectivity 
periods  of  constant  bursting  can  be  observed. 


(Fig.  5A)  we  found  ol  =  6.09  (4.61  -8.04)  for  the  shape  param- 
eter of  the  distribution,  and  ^  =  0.26  (0.20-0.35)  for  the 
scale  parameter  (95%  confidence  intervals  in  parentheses). 
For  Continuous  Run  4  (Fig.  5D),  the  corresponding  values 
were  Qf  =  7.14  (5.00-10.19),  and  ^  =  0.23  (0.16-0.33)  (simi- 
lar results  were  obtained  for  3  further  continuous  runs  and 
one  discontinuous  run  -  results  not  shown).  In  accord  with 
the  interpretation  of  Suffczynski  et  al.  (2005,  2006),  the  fact 
that  the  shape  parameter  a  was  larger  than  one  in  all  runs, 
suggests  the  potential  presence  of  periodicity  in  the  genera- 
tion of  bursting  epochs.  To  probe  this  link  further,  additional 
statistical  tests  were  performed  as  described  below. 

To  ascertain  how  intervals  were  being  drawn  from  their 
underlying  joint  distribution,  we  applied  a  non-parametric 
test  of  randomness  to  the  interbursting  phase  intervals  (Wald 
and  Wolfowitz,  1943).  In  the  case  of  Continuous  Run  1,  we 
found  that  one  cannot  reject  the  null  hypothesis  of  ran- 
domness at  the  5%  significance  level  (p-value  0.93).  For 
Continuous  Run  4,  we  found  the  same  conclusion  at  the  5% 
significance  level  (p-value  0.29).  This  conclusion  was  also 
borne  out  for  the  remaining  three  continuous  runs. 

A  method  for  the  detection  of  unstable  periodic  orbits 
(of  period-1)  was  then  applied  to  test  for  the  presence 
of  deterministic  dynamics  in  the  generation  of  interburst- 
ing phase  intervals  (So  et  al.,  1996,  1997).  This  method 
institutes  a  transformation  of  the  sequence  of  intervals  such 


that  the  transformed  sequence  is  clustered  around  locations 
of  potential  periodic  orbits.  One  can  compare  the  peaks  of 
these  clusters  to  those  generated  by  surrogate  data  (Theiler 
et  al.,  1992),  to  compute  the  statistical  significance  of  the 
peaks,  and  thereby  ascertain  the  potential  existence  of  peri- 
odic orbits  in  the  data.  Fig.  5C  and  F  shows  the  peaks  of  the 
clusters  were  not  significantly  greater  than  those  generated 
by  surrogate  data  (see  caption),  and  so  no  period-1  orbits 
were  detected  for  either  run  (nor  for  the  remaining  three 
continuous  runs),  at  the  limit  of  detection  in  the  current 
data  set. 


Discussion 

Our  results  demonstrate  that  while  holding  the  mean  prop- 
erties of  the  network  stable  (mean  connectivity  numbers, 
mean  background  excitation  rates),  very  rich  and  strikingly 
different  dynamics  are  produced  by  changing  the  model 
details.  Epileptogenic  behavior  can  be  created  in  these  net- 
works, as  described  above,  by  changes  in  the  random  pattern 
of  connectivity,  while  holding  fixed  the  intrinsic  active  or 
passive  membrane  properties  in  the  constituent  neurons. 
Such  changes  in  connectivity  could  be  analogous  to  changes 
in  underlying  connectivity  that  might  occur  following 
cerebral  insults,  or  repetitive  seizures.  Similar  modeling 
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Figure  5  Statistical  analysis  of  the  interbursting  phase  intervals.  (A  and  D)  Interval  histograms  are  fit  with  gamma  distributions 
revealing  shape  parameters  a>^  in  the  case  of  Continuous  Run  1  (A)  and  Continuous  Run  4  (D).  (B  and  E)  Two  dimensional  delay 
embedding  of  sequential  intervals  (Spearman  correlation  coefficients  are  not  significantly  different  from  zero)  for  Continuous  Runs 
1  and  4,  respectively.  (C  and  F)  Testing  the  significance  of  potential  period- 1  orbits  detected  through  a  dynamical  analysis  of  the 
sequence  of  points  displayed  in  (B)  and  (E),  respectively.  The  y-axis  represents  the  fraction  of  surrogate  (shuffled,  see  "Materials 
and  methods"  section)  sequences  with  a  maximal  deviation  from  the  mean  surrogate  result  of  greater  than  W  (So  et  al.,  1996, 
1997)  (10"^  random  matrices  and  10^  surrogates  were  used).  The  horizontal  dotted  (red)  line  displays  the  maximal  deviation  for  the 
simulation  data.  Since  there  exists  a  significant  fraction  of  surrogates  with  deviation  greater  than  that  for  the  simulation  data  (for 
both  C  -  20%  and  F  -  10%),  neither  plot  displays  convincing  evidence  of  the  existence  of  a  period-1  orbit.  (For  interpretation  of 
the  references  to  color  in  this  figure  legend,  the  reader  is  referred  to  the  web  version  of  this  article.) 


82 


work  has  been  performed  on  changes  in  connectivity  with 
resultant  epileptic  effects  in  hippocampus  (Morgan  and 
Soltesz,  2008;  Dyhrfjeld-Johnsenetal.,  2007).  In  our  results, 
local  increases  or  decreases  in  connectivity  in  the  model  may 
alter  the  network  in  a  similar  manner  (a  tempting  compar- 
ison which  must  be  tempered  by  our  lack  of  understanding 
of  the  functional  significance  of  new  or  absent  connections, 
see  Sallin  et  al.,  1995;  Dinocourt  et  al.,  2003;  Marco  et  al., 
1997;  Dudek  and  Sutula,  2007;  Magloscky,  2010).  In  the 
model  above,  the  external  background  activity  can  be  held 
constant  or  changed  in  a  variety  of  time-frequency  manip- 
ulations. Various  simulations  can  be  created  where  seizures 
occur  rarely  or  very  frequently. 

Similarly,  human  epileptic  seizures  are  episodic,  tran- 
sient events.  Whether  epileptic  seizures  are  random  events 
is  not  clear,  but  times  of  ictal  onset  can  behave  as  a  random 
process  (Suffczynski  et  al.,  2006).  In  some  patients  there 
are  no  identifiable  contributing  factors,  in  other  patients 
such  conditions  such  as  sleep  deprivation  may  increase  the 
chance  of  seizure  occurrence  and  in  still  other  patients 
seizures  can  be  provoked  by  specific  stimuli  (e.g.  hyperven- 
tilation, intermittent  photic  stimulation;  Lu  et  al.,  2008; 
Vinogradova  et  al.,  2009;  Kaplan  et  al.,  2009).  In  these 
examples  there  may  be  resulting  alterations  of  background 
activity  within  and  outside  the  epileptogenic  zone  (not  evi- 
dent from  the  EEG)  that  make  it  more  likely  that  a  seizure 
may  arise  from  the  existing  focal  epileptogenic  network. 
While  these  influences  might  create  a  state  where  seizures 
are  more  likely  to  occur,  this  facilitatory  state  should  be  dis- 
tinguished from  the  presence  (or  absence)  of  a  preictal  state 
in  unprovoked  seizures. 

Random  behavior  of  network 

One  interesting  aspect  of  this  model  is  its  ability  to  demon- 
strate both  spontaneous  periods  of  bursting  activity  as 
well  as  self-termination  of  the  bursting.  As  illustrated  in 
Figs.  2-5,  these  periods  of  bursting  can  be  quite  variable  in 
their  length.  Throughout  the  bursting  and  quiescent  phases, 
the  distributed  background  activity  is  constantly  active, 
affecting  approximately  1%  of  the  total  cells.  We  believe 
this  model  may  represent  the  type  of  network  behavior 
described  by  Lopes  da  Silva  et  al.  (2003),  in  which  epileptic 
activity  within  the  network  cannot  be  predicted  from  the 
interictal  state.  The  non-parametric  and  nonlinear  dynami- 
cal analyses  described  in  the  ''Results"  section  support  this, 
however  inference  from  this  is  somewhat  limited  given  the 
finite  size  of  the  data  set,  the  small  region  of  modeled  area, 
and  the  gamma  distribution  fits  described  earlier. 

This  obviously  may  not  be  true  of  all  types  of  clini- 
cal and  model  epileptiform  behavior.  For  example  Osorio 
et  al.  (2009,  2010)  show  that  pharmacoresistant  seizures 
tend  to  cluster,  and  may  have  an  inherent  self-triggering 
capacity.  This  might  make  a  prediction  algorithm  possible 
to  implement  in  a  useful  fashion.  Others  (Suffczynski  et  al., 
2005,  2006),  however  demonstrate  that  seizure  onset  can 
be  described  in  both  experimental  and  model  data  as  a  ran- 
dom walk  process,  with  possibly  a  deterministic  mechanism 
ascribed  to  seizure  termination.  The  random  onset  nature 
would  (in  a  bistable  network  with  Poisson  transitions)  be 
difficult  to  predict. 


In  the  case  of  our  model  the  underlying  fluctuations  lead- 
ing to  seizure  onset  are  the  random  background  activity  we 
have  imposed  on  the  network.  The  properties  of  the  network 
connectivity  then  support  the  bursting  frequency  observed 
(Anderson  et  al.,  2007).  These  types  of  grossly  synchronized 
bursting  states  in  the  context  of  a  neural  network  have 
been  studied  extensively  by  Kowalski  et  al.  (1992).  They  are 
truly  pathological  in  the  sense  that  they  would  block  or  con- 
found any  information  flow  through  this  network.  It  is  also  a 
network-generated  state,  and  can  be  stopped  by  eliminating 
synaptic  transmission  (Kowalski  et  al.,  1992;  Keefer  et  al., 
2001;  Rhoades  and  Gross,  1994). 

Many  limiting  cases  of  the  gamma  distribution  have  phys- 
ical interpretations  that  might  make  it  easier  to  understand 
spiking  data  from  cortex  when  used  for  fitting  (Papoulis, 
1984;  Suffczynski  et  al.,  2005).  When  the  shape  parameter. 
Of,  in  the  gamma  distribution  is  an  integer,  the  distribution 
is  known  as  an  Erlang  distribution  and  represents  the  prob- 
ability distribution  of  the  waiting  time  until  the  ath  event 
from  a  sampled  Poisson  process  with  characteristic  time  p. 
This  might  be  comparable  to  the  distribution  of  the  number 
of  spiking  or  underlying  synaptic  events  required  to  trigger 
the  network  bursting  behavior.  One  could  envision  trying  to 
extract  the  integer  value  of  a  from  either  computational 
or  experimental  data.  Similarly,  the  Maxwell-Boltzmann 
distribution  can  be  related  to  the  gamma  distribution 
under  certain  restrictions  on  the  gamma  scale  and  shape 
parameters,  implying  the  possibility  of  extracting  almost 
thermal-like  or  statistical  mechanical  interpretations  of  the 
network  activity  (Hegyi,  1996). 


Limitations  of  the  model 

Our  plots  demonstrate  gross  summed  numbers  of  time- 
binned  action  potentials  in  the  model  for  given  neuron 
classes.  The  interictal  activity  is  a  random  Poisson  input  to 
1  %  of  the  cells  in  the  model  and  is  demonstrated  in  the  plots, 
particularly  in  Fig.  2,  Panel  1,  inset.  We  chose  to  impose 
this  random  interictal  behavior  on  the  model  to  demon- 
strate that  these  fluctuations  can  produce  very  coherent 
synchronous  oscillations  in  an  unpredictable  fashion.  How- 
ever, the  surrounding  input  might  not  have  to  be  completely 
random  to  bring  about  the  same  effect.  Epilepsies  involving 
specific  stimuli  might  require  a  coherent  surrounding  input 
to  give  rise  to  the  seizure  (Lu  et  al. ,  2008;  Vinogradova  et  al. , 
2009;  Kaplan  et  al.,  2009).  A  more  realistic  technique  would 
be  to  treat  the  interictal  background  as  a  log-normal  process 
which  does  have  some  support  in  the  literature  (see  Farkhooi 
et  al.,  2009;  Waters  and  Helmchen,  2006).  Newer  recording 
methods  from  invasively  monitored  epilepsy  patients  might 
help  determine  what  patterns  of  background  activity  are 
causative  (Truccolo  et  al.,  2011;  van  Compel  et  al.,  2008). 
Our  intent  was  to  represent  the  resting  interictal  cellular 
activity  as  fundamentally  sparse  with  a  random  component. 
This  was  most  easily  implemented  as  a  low  frequency  Poisson 
process. 

It  is  possible  to  view  the  single  synaptic  input  (driven 
by  the  background  source)  as  representing  several  weaker 
but  synchronized  inputs.  This  represents  a  limitation  to  this 
modeling  approach,  a  limitation  that  in  large  degree  could 
be  corrected  with  more  elaborate  multicompartment  and 
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synaptic  representations  of  the  cells  to  make  the  multiple 
weaker  inputs  more  independent.  Additionally  in  this  lim- 
ited data  set,  we  are  unable  to  say  much  about  time  epochs 
larger  than  several  hundreds  of  seconds  (our  largest  con- 
tinuous simulation  being  320  s).  This  is  clearly  a  limitation 
in  this  technique  and  future  computational  work,  including 
efforts  in  our  laboratory  will  explore  longer  time  intervals 
of  ictal  and  interictal  behavior,  and  possibly  push  the  detec- 
tion limit  for  predictable  activity  lower  (or  detect  it  more 
accurately).  Models  such  as  this  particular  rigid  crystalline 
arrangement  of  neurons  with  many  fixed  cellular  properties 
in  some  sense  have  less  inherent  ''randomness"  than  real 
neocortex.  This  work  was  primarily  meant  to  spark  inter- 
est in  a  possible  mechanism  for  the  difficulties  inherent  in 
seizure  prediction,  but  by  no  means  should  it  be  interpreted 
too  literally. 

Implications  for  seizure  prediction 

The  purpose  of  the  model  presented  here  is  not  to  judge  the 
effectiveness  of  seizure  prediction,  but  rather  to  present  a 
plausible,  alternative  hypothesis  for  partial  seizure  occur- 
rence that  could  explain  situations  where  seizure  prediction 
may  not  be  possible.  Indeed,  it  is  conceptually  attractive  to 
consider  that,  just  as  partial  seizures  may  result  from  vari- 
ous pathologies  and  mechanisms,  that  some  partial  seizures 
may  not  be  reliably  predicted.  It  is  beyond  the  scope  of 
this  discussion  to  address  the  various  methods  being  used 
in  attempts  to  predict  epileptic  seizures.  It  is  always  impor- 
tant to  differentiate  true  seizure  prediction  from  improved 
seizure  detection.  Other  commentaries  and  reviews  address 
these  methods  and  include  discussions  of  the  challenges 
and  frustrations  to  date  in  routine  seizure  prediction  even 
with  intracranial  electrodes  (Estellar  et  al.,  2001;  Litt  and 
Echauz,  2002;  Sackellares  et  al.,  2006;  Haas  et  al.,  2007; 
Osorio  et  al.,  2001). 

This  study  focused  on  the  interval  to  the  time  of  the 
next  ''seizure"  or  busting  phase  in  the  model.  Our  interest 
was  in  the  occurrences  of  the  transitions  from  the  quiescent 
or  background  state  into  the  pathologic  state,  since  that 
is  what  most  seizure  prediction  algorithms  are  optimized 
for.  There  is  fairly  strong  evidence,  certainly  in  the  case  of 
complex  partial  seizures  in  temporal  lobe  epilepsy,  that  the 
length  of  clinical  seizures  can  be  fairly  uniform  in  a  given 
patient  (see  for  instance  Afra  et  al.,  2008).  The  time  inter- 
val durations  of  the  seizures  themselves  may  also  prove  to 
have  to  predictive  guidance  as  well  and  should  be  explored 
in  the  future  in  modeling  efforts.  This  may  be  more  useful 
in  the  case  of  neocortical  epilepsy  with  its  rapid  spread  and 
possible  involvement  of  larger  regions  of  tissue. 

Additionally,  this  model  can  incorporate  incremental 
changes  in  connectivity  in  the  epileptogenic  zone,  changes 
that  could  be  a  model  for  progressive  epileptogenesis  (e.g. 
sprouting).  This  type  of  model  also  provides  data  that  is 
comparable  to  clinical  data  from  epilepsy  patients.  The 
simulated  network  activity  is  taken  from  a  small  region  of 
modeled  cortex  comparable  in  size  to  the  surface  area  under 
a  typical  subdural  grid  electrode,  and  makes  comparisons 
between  modeling  efforts  and  clinical  data  easy  to  perform 
(Anderson  et  al.,  2007,  2009;  Kudela  et  al.,  1997,  2003a, b; 
Franaszczuk  et  al.,  2005).  Indeed  the  major  advantage  of 


neuronal  network  modeling  is  the  ability  to  simultaneously 
monitor  activity  in  all  of  the  network  neurons  under  given 
experimental  conditions,  something  not  possible  with  bio- 
logical systems,  even  with  sophisticated  recording  arrays. 
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Social  living  goes  hand  in  hand  with  communication,  but  the  details  of  this  relationship  are  rarely 
simple.  Complex  communication  may  be  described  by  attributes  as  diverse  as  a  species'  entire 
repertoire,  signallers'  individualistic  signatures,  or  complex  acoustic  phenomena  within  single 
calls.  Similarly,  attributes  of  social  complexity  are  diverse  and  may  include  group  size,  social  role 
diversity,  or  networks  of  interactions  and  relationships.  How  these  different  attributes  of  social 
and  communicative  complexity  co-evolve  is  an  active  question  in  behavioural  ecology.  Sciurid 
rodents  (ground  squirrels,  prairie  dogs  and  marmots)  provide  an  excellent  model  system  for  study- 
ing these  questions.  Sciurid  studies  have  found  that  demographic  role  complexity  predicts  alarm  call 
repertoire  size,  while  social  group  size  predicts  alarm  call  individuality.  Along  with  other  taxa,  sciur- 
ids  reveal  an  important  insight:  different  attributes  of  sociality  are  linked  to  different  attributes  of 
communication.  By  breaking  social  and  communicative  complexity  down  to  different  attributes, 
focused  studies  can  better  untangle  the  underlying  evolutionary  relationships  and  move  us  closer 
to  a  comprehensive  theory  of  how  sociality  and  communication  evolve. 

Keywords:  sociality;  individuality;  repertoire  size;  alarm  call;  information  theory;  vocal  complexity 


1.  WHAT  IS  COMPLEXITY? 

While  most  of  us  have  an  intuitive  idea  of  what  constitu- 
tes complexity,  complexity  is  difficult  to  define  [1]. 
A  system  is  typically  considered  more  complex  if  it  con- 
tains more  parts,  more  variability  or  types  of  parts,  more 
connections  or  types  of  connections  between  parts  or 
more  layers  of  embedded  meaning.  The  information 
[2]  required  to  describe  a  system  can  be  thought  of  as 
a  measure  of  the  system's  complexity  ('Kolmogorov 
complexity',  after  [3]).  To  illustrate,  Dawkins  [4, 
p.  265]  provides  a  simple  thought  experiment:  when 
comparing  two  items  or  systems,  imagine  writing  a 
book  to  describe  each  one.  The  longer  book  will 
describe  the  more  complex  system.  Information-based 
definitions  of  complexity  are  most  useful  because  they 
allow  complexity  to  be  quantified  and  compared  with 
a  single  metric  [5] . 

Social  complexity  can  be  defined  in  numerous  ways 
(e.g.  the  number  of  individuals  [6-8],  the  number  of 
demographic  or  social  roles  [9,10],  the  strength 
of  social  bonds  [11-13],  the  complexity  of  group  sub- 
structure and  relationships  [14-18]  and  combinations 
of  these  [19,20]).  Each  of  these  definitions  can  be  con- 
sidered an  attribute  of  sociality.  What  these  definitions 
have  in  common  is  they  quantify  social  complexity  via 
the  number  or  variability  of  the  social  system's  parts, 
connections  or  layers  of  categorization  (table  1). 
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One  integral  attribute  of  social  complexity  is  per- 
haps the  most  straightforward:  social  group  size.  The 
number  of  individual  animals  present  in  a  social  group 
infiuences  that  group's  resource  needs  [21],  disease 
ecology  [22]  and  predation  risk  [23].  Furthermore, 
social  group  size  defines  boundaries  for  other  attributes 
of  social  complexity,  such  as  mating  systems  or  social 
networks  [24] .  Group  size  also  directly  affects  the  com- 
municative landscape,  since  more  individuals  may  be 
communicated  with,  followed,  discriminated  or  ignored 
[5,25-27].  Social  group  size  is  often  straightforward  to 
quantify.  If  a  social  group's  boundaries  are  well  defined, 
then  a  simple  head  count  is  all  that  is  needed.  For  all 
these  reasons,  it  is  not  surprising  that  social  group  size 
is  one  of  the  most  commonly  studied  attributes  of 
social  complexity  and  has  been  used  in  a  variety  of 
taxa  [7,28-38]. 

Another  critical  attribute  of  social  complexity  is  the 
number  or  variability  of  social  roles  in  a  social  group. 
A  role  can  be  thought  of  as  a  socially  expected  behav- 
ioural pattern.  For  instance,  social  roles  may  be  based 
on  whether  an  individual  is  a  dominant  or  subordinate, 
a  producer  or  scrounger,  or  a  breeder  or  non-breeder. 
Blumstein  &  Armitage  [39]  quantified  social  roles  by 
focusing  on  demographic  (age/sex)  roles  in  social 
groups,  which  can  be  quantified  using  information 
theory.  The  key  assumption  was  that  groups  with 
more  overlapping  generations,  and  those  with  more 
age-sex  classes  present,  were  more  socially  complex 
in  their  demographic  roles. 

Complexity  in  communication  can  be  defined  analo- 
gously to  complexity  in  sociality,  i.e.  via  the  number  or 
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Figure  1.  Alarm  call  repertoires  of  seven  species  of  marmots  {Marmota  spp.).  Sciurid  rodents  differ  in  the  acoustic  structure  of 
their  alarm  calls  and  in  the  size  of  their  alarm  call  repertoires. 


variability  of  the  signal's  or  system's  parts,  connections, 
or  layers  of  meaning  (table  1).  For  acoustic  communi- 
cation, attributes  of  communicative  complexity  can 
include  a  species'  repertoire  size  (number  of  call  types 
[32,39])  or  the  syntactical  complexity  or  uncertainty 
[40,41]  of  sequences  of  calls.  Signals  may  also  be  complex 
in  their  association  w^ith  external  referents,  such  as  func- 
tionally referential  alarm  calls  that  communicate 
predator  type  or  response  strategy,  in  addition  to  alarm 
[42-45].  Communication  of  urgency  or  affective  state  is 
another  form  of  complexity,  providing  additional  infor- 
mation about  the  situation  [46-48].  For  example,  many 
species  of  marmots  alter  the  rate  of  their  alarm  calls 
Wixh  the  urgency  of  the  situation  or  the  degree  of  risk 
[49].  The  presence  of  embedded  signature  information 
(such  as  age,  sex  or  individual  caller  identity  [50-54]) 
also  adds  complexity  to  communicative  signals. 

As  social  complexity,  however  quantified,  increases, 
more  complex  communication  might  be  needed  to 
manage  this  social  complexity  [55-59].  Attributes  of 
social  complexity  may  thus  generate  a  need  for  animals 
to  exhibit  different  types  of  communicative  complexity. 

2.  SCIURIDS  AS  A  MODEL  SYSTEM 

Sciurid  rodents  (Rodentia:  Sciuridae),  particularly 
ground-dwelling  social  species  (tribe  Marmotini, 
including  ground  squirrels  Spermophilus  spp.  and  re- 
lated genera,  prairie  dogs  Cynomys  spp.,  and  marmots 
Marmota  spp.),  present  an  excellent  model  comparative 
system  for  studying  the  relationship  between  social  and 
communicative  complexity,  for  several  reasons. 

First,  ground-dwelling  sciurids  constitute  a  speciose 
clade  that  exhibits  a  wide  range  of  social  structures. 
On  one  end  of  the  spectrum,  some  species  are  nearly 
solitary    or    live    in    simple    family    groups  (e.g. 


woodchucks,  Marmota  monax^  live  in  groups  of  a 
mother  plus  her  young  of  the  year).  Within  the  same 
clade,  more  complicated  group  structures  are  seen, 
such  as  those  in  Eurasian  marmots  (e.g.  M.  caudata^ 
M.  marmota  [60])  in  which  young  delay  dispersal  for 
one,  two  or  more  years,  adult  males  participate  in 
group  life,  and  females  may  aggregate  in  harems  or 
matrilines.  Social  group  size  also  varies  considerably 
across  ground-dwelling  sciurids,  as  does  group  cohe- 
sion. In  addition,  sciurids  also  vary  in  their  tiers  or 
levels  of  social  complexity  [61,62],  with  some  species 
exhibiting  complex  relationships  between  segregated 
social  groups  (e.g.  some  prairie  dogs).  Importantly, 
variation  in  social  group  size  is  not  strictly  tied  to  vari- 
ation in  other  attributes  of  social  structure  [9], 
allowing  these  different  attributes  to  be  teased  apart 
in  comparative  study. 

Secondly,  ground-dwelling  sciurids  offer  a  com- 
munication system  amenable  to  comparative  study. 
Ground-dwelling  sciurids  produce  vocal  alarm  calls 
in  response  to  predatory  stimuli  [63].  These  calls  are 
typically  loud  and  perceptually  salient,  and  they  can 
be  elicited  and  recorded  by  researchers.  Within  the 
sciurid  alarm  call  system,  an  array  of  communicative 
complexity  is  exhibited.  Species  vary  in  the  acoustic 
structure  of  their  calls  [39,54,64],  the  size  of  their 
alarm  call  repertoires  [39,49]  (figure  1),  the  response 
urgency  or  distance  to  predator  encoded  in  their  calls 
[65-67],  syntactic  or  ordering  differences  in  calls 
[64] ,  and  the  degree  to  which  age  and  sex  and  individ- 
ual identity  are  encoded  [54,68].  As  with  social 
complexity,  we  can  view  these  as  attributes  of  commu- 
nicative complexity  and  they  can  be  independently 
studied  in  comparative  analyses. 

Thirdly,  published phylogenies  [69-71]  aidphylogen- 
etically  controlled  evolutionary  analyses  in  this  taxon. 


To  date,  two  phylogenetically  controlled  studies  have 
examined  the  correlated  evolution  of  social  complex- 
ity and  communicative  complexity  in  ground-dwelling 
sciurid  rodents  [39354] .  Importantly,  comparative  studies 
in  sciurids  permit  us  to  decouple  drivers  of  complexity, 
and,  as  discussed  below,  have  revealed  that  social  com- 
plexity, broadly  defined,  does  not  simply  select  for  more 
complex  communication.  Rather,  different  attributes 
of  sociality  seemingly  select  for  specific  attributes  of 
communicative  complexity. 


3.  DEMOGRAPHIC  COMPLEXITY  DRIVES 
REPERTOIRE  SIZE  EVOLUTION 

Blumstein  &  Armitage  [39]  used  22  species  of  ground- 
dwelling  sciurids  to  test  for  a  relationship  between 
one  attribute  of  social  complexity  (demographic  role 
complexity)  and  one  attribute  of  communicative  com- 
plexity (alarm  call  repertoire  size).  They  defined  a 
social  complexity  index  via  information  theory,  using 
data  on  dispersal  patterns,  the  age/sex  composition  of 
groups  and  variability  in  these  traits  [39] .  The  resulting 
numbers,  expressed  in  bits,  quantify  the  amount  of 
information  needed  to  describe  the  demographic  role 
complexity  in  different  species. 

For  species  with  multiple  alarm  call  types,  different 
call  types  may  be  used  to  communicate  about  different 
types  of  predators  [42,43,45],  to  communicate  different 
degrees  of  risk  or  response  urgency  [46,48],  or  to  com- 
municate both  predator  type  and  urgency  [44,72]. 
Species  living  in  more  complex  social  groups,  such  as 
groups  with  more  social  demographic  roles,  may  have 
greater  need  to  signal  alarm  in  a  more  complex 
manner  and  thus  may  use  larger  alarm  call  repertoires. 
Across  ground-dwelling  sciurids,  alarm  call  repertoire 
size  varies  from  one  to  five  (see  examples  in  figure  1). 
Blumstein  &  Armitage  [39]  used  a  combination  of 
field  recordings  and  literature  review  to  establish  the 
alarm  call  repertoire  size  for  22  species  of  sciurid  and 
regressed  this  against  the  social  complexity  index. 
Demographic  role  complexity  significantly  explained 
variation  in  alarm  call  repertoire  size,  both  in  analyses 
of  raw  data  and  independent  contrasts  (figure  2). 

More  up-to-date  phylogenies  have  since  been  pub- 
lished, so  we  re-ran  the  independent  contrasts  analysis 
using  more  recent  phylogeny  taken  from  Herron  et  al. 
[70]  and  Steppan  et  al.  [71].  The  relationship  between 
social  demographic  complexity  and  alarm  call  repertoire 
size  remained  significant  (r^  =  0.228,  y  =  1.049x,  p  = 
0.025,  fz  =  21  independent  contrasts).  An  increase  in 
social  demographic  roles  appears  to  drive  the  ability  to 
communicate  via  a  larger,  more  complex  repertoire  of 
alarm  calls.  The  functional  explanation  underlying  this 
correlation  is  unknown.  Future  study  of  alarm  call 
type  use  in  different  contexts  or  between  different 
caller- receiver  dyads  may  be  necessary  to  untangle 
how  more  demographically  complex  species  may  benefit 
from  a  more  diverse  alarm  call  repertoire. 


4.  SOCIAL  GROUP  SIZE  DRIVES  THE  EVOLUTION 
OF  INDIVIDUAL  SIGNATURES 

Pollard  &  Blumstein  [54]  compared  sciurid  vocal  signa- 
ture individuality  (individual  distinctiveness)  against 
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Figure  2.  Relationship  between  social  demographic  complex- 
ity and  alarm  call  repertoire  size  across  22  species  of  sciurid 
rodents,  in  raw  (a)  and  independent  contrasts  {Jo)  data. 
Social  complexity  (variability  in  demographic  roles)  correlates 
with  alarm  call  repertoire  size,  (a)  Adapted  from  Blumstein  & 
Armitage  [39] ,  with  permission  from  Chicago  Journals. 
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Figure  4.  Possible  functional  relationships  between  attributes  of  social  complexity  and  attributes  of  communicative  complexity. 
For  specific  examples  of  relationships  that  are  supported  in  the  literature,  see  table  2. 


social  group  size,  Blumstein  &  Armitage's  [39]  demo- 
graphic complexity  index,  and  two  other  indices  of 
social  structure  complexity  [61,62].  Group  size  was 
not  correlated  with  these  other  complexity  indices 
[54].  Because  studies  quantifying  vocal  individuality 
are  not  common,  this  analysis  required  novel  data,  and 
eight  species  of  ground-dwelling  sciurid  were  thus  cap- 
tured, individually  marked  and  recorded  on  multiple 
occasions.  Individuality  was  calculated  from  acoustic 
traits  using  an  information-theory  metric  developed 
by  Beecher  [5,73]  and  used  in  previous  studies 
[25,27,52,74].  The  individuality  information  statistic 
quantifies,  in  bits,  the  amount  of  individually  specific 
information  content  present  in  a  species'  vocalizations. 
This  statistic  was  used  as  the  quantitative  metric  of 
one  attribute  of  communicative  complexity. 

Individual  signatures  in  alarm  calls  are  important 
for  animals  such  as  sciurids  [75-77].  Individuals 
vary  in  their  alarm  signal  reliability,  and  listeners 
benefit  when  they  recognize  the  individual  identity  of 
an  alarm  caller,  as  this  allows  them  to  better  calibrate 
their  behavioural  response  [76-80].  Ground-dwelling 
sciurids  live  in  groups  of  closely  related  kin,  such 
that  the  signaller  and  receiver  are  often  close  relatives. 
This  may  allow  callers  to  benefit  via  kin  selection  by 
aiding  related  receivers  in  individual  discrimination 
and  reliability  assessment  [54,77],  although  they  may 
also  benefit  via  reciprocal  altruism  [81]. 

Individuality  is  expected  to  evolve  with  group  size 
[5,25-27,52,54,73].  As  social  group  size  increases, 
the  number  of  individuals  that  must  be  discriminated 
increases  accordingly,  making  individual  recognition 
tasks  more  difficult.  Increased  individuality  would  be 
necessary  to  permit  successful  discrimination  of  all 
the  individuals  in  the  group.  Since  other  attributes  of 


sociality  affect  communicative  complexity  [39,82], 
these  other  attributes  may  also  influence  individuality. 

Pollard  &  Blumstein  [54]  gathered  typical  social  group 
size  data  from  the  literature  and  regressed  this  against  the 
individuality  metric.  Group  size  explained  considerable 
variation  in  vocal  individuality,  both  in  the  raw  data 
and  in  an  analysis  using  independent  contrasts 
(figure  3).  As  predicted,  social  group  size  appears  to 
drive  the  evolution  of  individual  signature  information. 

Interestingly,  it  is  group  size,  not  other  social 
complexity  attributes,  that  seems  responsible  for  the 
evolution  of  individually  specific  vocalizations.  Signa- 
ture information  was  also  regressed  against  three  other 
measures  of  social  structure  complexity,  including 
Blumstein  &  Armitage's  [39]  demographic  complexity 
index,  Michener's  [62]  social  grade,  and  Armitage's 
[61]  sociality  index.  'Social  grade'  and  'sociality  index' 
are  somewhat  subjective  numerical  assignments 
intended  to  quantify  the  complexity  of  each  species' 
social  structure  and  mating  system.  Individuality  was 
not  significantly  related  to  any  of  these  three  other 
social  complexity  attributes,  even  after  controlling  for 
variation  in  group  size. 

5.  ATTRIBUTES  OF  SOCIALITY,  ATTRIBUTES 
OF  COMMUNICATION 

Results  from  sciurid  studies  highlight  an  important 
caveat  about  the  evolution  of  social  and  communicative 
complexity.  Both  sociaUty  and  communication  have 
multiple  attributes  (table  1),  and  these  attributes  affect 
one  another  in  complex  ways  (figure  4).  The  main 
insight  from  our  integrative  overview  is  that  different 
attributes  of  social  complexity  are  likely  to  drive  differ- 
ent attributes  of  communicative  complexity.  Thus,  it  is 
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important  to  think  clearly  about  the  nature  of  the 
relationship  because  different  attributes  work  different 
ways  (see  also  [1]).  Studies  from  other  taxa  also  support 
taking  a  nuanced  look  at  the  correlated  evolution 
of  social  and  communicative  complexity  (table  2) .  For 
example,  the  complexity  of  reproductive  roles  was 
found  to  influence  individuality  in  wasp  facial  markings 
[82],  while  nesting  or  roosting  colony  size  was  found  to 
influence  individuality  in  avian  and  chiropteran  contact 
calls  [25-27].  In  phocid  seals,  underwater  repertoire 
size  increased  with  mating  system  complexity  [88].  In 
primates,  social  group  size  and  time  spent  in  social 
grooming  influenced  vocalization  repertoire  size  [32], 
while  group  size  influenced  ability  to  produce  varied 
facial  expressions  [35].  Furthermore,  social  complexity 
may  influence  not  just  the  complexity  of  communica- 
tive signals  themselves  but  also  the  perceptual  capacity 
to  receive  such  signals,  e.g.  social  group  size  predicts 
auditory  sensitivity  in  lemurs  [84] .  Social  and  commu- 
nicative complexity  can  also  covary  within  species  or 
on  short  time  scales;  for  example,  chickadees  placed  in 
larger  groups  displayed  more  variability  in  their  call 
types  and  combinations  [40] .  In  all  these  cases,  a  differ- 
ent attribute  of  social  complexity  is  associated  with  a 
different  attribute  of  communicative  complexity. 

As  a  general  rule,  we  would  predict  that  the  types  of 
communicative  complexity  that  are  most  relevant  to  a 
speciflc  aspect  of  sociality  would  be  evolving  with 
those  aspects  of  sociality.  For  example,  if  individual 
recognition  is  important,  and  if  all  members  of  a 
group  must  be  recognized,  we  would  predict  a  speciflc 
type  of  communicative  complexity  (individualistic  sig- 
natures) to  evolve  with  a  particular  attribute  of  social 
complexity  (group  size).  This  has  been  found  in  birds, 
bats  and  sciurids  [25-27,54] .  If  calls  are  used  to  attract 
or  impress  mates,  we  would  predict  speciflc  attributes  of 
communicative  complexity  (such  as  syntactical  com- 
plexity or  repertoire  size)  to  evolve  with  a  speciflc 
attribute  of  social  complexity  (mating  system) .  Indeed, 
male  repertoire  size  seems  to  have  evolved  with  mating 
system  in  phocid  seals  [88].  However,  these  relation- 
ships are  not  always  obvious,  and  attributes  of  sociality 
may  influence  communication  in  diverse  and  compli- 
cated ways  (flgure  4) .  Future  studies  in  more  taxa  will 
be  necessary  to  comprehensively  identify  the  many 
ways  in  which  distinct  attributes  of  social  and  communi- 
cative complexity  are  evolutionarily  or  functionally 
linked.  With  the  discovery  that  social  network  statistics 
can  be  used  by  behavioural  biologists  [18,91,92], 
there  are  many  precisely  deflned  attributes  of  sociality 
that  could  be  studied.  From  a  comparative  perspective, 
available  data  are  often  a  limiting  factor,  but  over  time, 
there  will  be  more  data  from  different  species  available 
with  which  to  study  the  evolution  of  social  and  com- 
municative attributes  [93].  As  these  databases  are 
developed,  sciurid  rodents  can  continue  to  play  a 
valuable  role  in  this  pursuit,  because  these  species  vary 
in  several  attributes  of  social  and  communicative 
complexity,  and  systematic  study  can  help  identify 
which  of  these  attributes  covary.  These  attributes  can 
be  further  examined  with  respect  to  different  environ- 
mental conditions  in  sciurids'  diverse  habitats,  and/or 
mapped  onto  available  phylogenies  to  estimate  when 
and  how  complexity  evolved. 


6.  CONCLUSIONS 

The  relationship  between  social  complexity  and  com- 
municative complexity  is  multi-faceted.  Different 
attributes  of  sociality  may  drive  the  evolution  of  differ- 
ent attributes  of  communication.  For  example,  in 
sciurid  rodents,  social  demographic  complexity 
explains  the  evolution  of  alarm  call  repertoire  size, 
while  social  group  size  explains  the  evolution  of 
alarm  call  vocal  individuality.  Sciurids  are  one  of 
many  excellent  model  systems  in  which  to  explore 
the  evolution  of  social  and  communicative  complexity, 
and  future  work  in  sciurid  and  non-sciurid  taxa  will 
help  build  a  comparative  database  in  which  these  ques- 
tions can  be  thoroughly  examined.  The  ever-growing 
comparative  database  will  ultimately  help  us  develop 
a  comprehensive  understanding  of  how  social 
complexity  and  communicative  complexity  evolve. 
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Abstract 

This  article  presents  the  integration  of  brain  injury  biomechanics  and  graph  theoretical  analysis  of  neuronal  connections,  or 
connectomics,  to  form  a  neurocomputational  model  that  captures  spatiotemporal  characteristics  of  trauma.  We  relate 
localized  mechanical  brain  damage  predicted  from  biofidelic  finite  element  simulations  of  the  human  head  subjected  to 
impact  with  degradation  in  the  structural  connectome  for  a  single  individual.  The  finite  element  model  incorporates  various 
length  scales  into  the  full  head  simulations  by  including  anisotropic  constitutive  laws  informed  by  diffusion  tensor  imaging. 
Coupling  between  the  finite  element  analysis  and  network-based  tools  is  established  through  experimentally-based  cellular 
injury  thresholds  for  white  matter  regions.  Once  edges  are  degraded,  graph  theoretical  measures  are  computed  on  the 
"damaged"  network.  For  a  frontal  impact,  the  simulations  predict  that  the  temporal  and  occipital  regions  undergo  the  most 
axonal  strain  and  strain  rate  at  short  times  (less  than  24  hrs),  which  leads  to  cellular  death  initiation,  which  results  in  damage 
that  shows  dependence  on  angle  of  impact  and  underlying  microstructure  of  brain  tissue.  The  monotonic  cellular  death 
relationships  predict  a  spatiotemporal  change  of  structural  damage.  Interestingly,  at  96  hrs  post-impact,  computations 
predict  no  network  nodes  were  completely  disconnected  from  the  network,  despite  significant  damage  to  network  edges. 
At  early  times  (/<24hrs)  network  measures  of  global  and  local  efficiency  were  degraded  little;  however,  as  time  increased 
to  96  hrs  the  network  properties  were  significantly  reduced.  In  the  future,  this  computational  framework  could  help  inform 
functional  networks  from  physics-based  structural  brain  biomechanics  to  obtain  not  only  a  biomechanics-based 
understanding  of  injury,  but  also  neurophysiological  insight. 
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Introduction 

The  finite  element  method  is  often  used  to  study  neurotrauma 
[1-9]  and  continues  to  emerge  as  a  useful  tool  in  the  field  of 
neuroscience  [10-13].  Models  continue  to  advance  in  biofidelity 
by  incorporating  an  increased  level  of  anatomic  detail  [4,13,14], 
improved  representation  of  the  material  behavior  at  various 
loading  rates  [15-20],  and  advanced  measures  and  predictions  of 
injury  [8,21,22].  Finite  element  models  are  commonly  used  to 
understand  the  biomechanics  of  brain  and  skull  deformation  when 
the  head  is  subjected  to  insult,  leading  to  improved  insight  into 
mechanisms  of  acute  injury.  For  example,  modeling  axonal  injury 
mechanisms  within  white  matter  of  the  brain  has  been  the  focus  of 
some  recent  efforts  and  provides  a  means  to  relate  an  insult  to  a 
cellular  injury  mechanism  [8].  By  using  diffusion  tensor  imaging 
(DTI)  fiber  tractography,  the  structural  orientation  of  neuronal 
axonal  bundles  can  be  incorporated  into  the  finite  element  model 
and  can  be  used  to  compute  the  axonal  strain  during  brain  white 
matter  deformation  [8,23].  Then,  by  using  an  axonal  injury 
threshold,   the   occurrence   of  diffuse   axonal  injury  (DAI)  is 


predicted  [8].  Wright  and  Ramesh  [8]  show  that  the  degree  of 
injury  predicted  is  highly  dependent  on  the  incorporation  of  the 
axonal  orientation  information  and  the  inclusion  of  material 
anisotropy  into  the  constitutive  model  for  white  matter.  By 
modeling  the  underlying  mechanism  of  DAI,  an  enhanced 
understanding  of  the  neurotrauma  is  attained  through  a  spatio- 
temporal  description  of  tissue  deformation.  In  general,  finite 
element  simulations  may  help  to  elucidate  the  injury  mechanisms 
of  neurotrauma. 

As  finite  element  models  advance,  experimentally  based  models 
of  neurotrauma  also  continue  to  become  more  sophisticated, 
ranging  from  the  macroscopic  [24,25]  to  the  cellular  level  [26-29]. 
Various  biomechanical  and  physiological  injury  thresholds  for 
neurotrauma  have  been  proposed  in  the  past,  including  intracra- 
nial pressure  [21]  and  strain  [8,30,31].  While  these  thresholds  offer 
an  immediate  prediction  of  injury,  they  lack  a  long-term 
description  of  functional  degradation.  A  time-evolving  injury 
model  is  attractive  since  the  biological  response  occurs  on  a  slower 
time  scale  than  an  injurious  stimulus  as  a  consequence  of 
mechanotransduction  cascades  [27].  There  have  been  significant 
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Author  Summary 

According  to  the  Centers  for  Disease  Control  and 
Prevention  in  the  United  States,  approximately  1.7  million 
people,  on  average,  sustain  a  traumatic  brain  injury 
annually.  During  the  last  few  decades,  brain  neurotrauma 
biomechanics  has  been  an  active  area  of  research 
involving  medical  clinicians  and  a  broad  range  of  scientists 
and  engineers.  In  addition,  advances  and  fast  growth  of 
human  connectomics  continues  to  reveal  new  insights  into 
the  damaged  brain.  With  recent  advances  in  computa- 
tional methods  and  high  performance  computing,  we  see 
the  need  and  the  exciting  possibility  to  merge  brain 
neurotrauma  biomechanics  and  human  connectomics 
science  to  form  a  new  area  of  investigation  -  connectome 
neurotrauma  mechanics.  For  neurotrauma,  the  idea  is 
simple  -  inform  human  structural  connectome  analysis 
using  physics-based  predictions  of  biomechanical  brain 
injury.  If  successful,  this  technique  may  be  further  used  to 
inform  human  functional  connectome  analysis,  thus 
providing  a  new  tool  to  help  understand  the  pathophys- 
iology of  mild  traumatic  brain  injury. 

efforts  to  develop  empirically  based  time-evolving  cellular  injury 
thresholds,  which  many  times  use  in  vitro  cellular  and  tissue 
culture  models.  Morrison  III  et  al.  [27]  suggest  that  for  brain 
biomechanics,  neuronal  culture  models  that  accurately  mimic 
specific  brain  features  can  be  used  to  explore  tissue  properties  and 
tolerances  or  thresholds  to  mechanical  loading.  Cell  death  has 
been  primarily  used  as  a  definition  of  injury  within  the  neuronal 
culture  model  community  and  has  been  applied  to  determine 
tissue-level  tolerance  criteria  using  local  values  of  axonal  strain, 
strain  rate,  and  time  from  "insult"  [32,33].  Furthermore,  as 
Morrison  III  et  al.  [27]  point  out,  empirical  functions  for  cellular 
death  based  on  culture  models  could  be  incorporated  into  finite 
element  analyses,  thereby  enabling  biological  predictions  to 
supplement  mechanical  predictions  of  local  tissue  stress  and  strain. 
In  the  study  presented  here,  this  concept  is  further  explored  and 
used  as  a  bridge  to  a  network-based  analysis  of  the  brain. 

As  physics-based  models  become  more  capable  of  predicting 
tissue-level  injury  mechanisms  from  improved  computational  and 
experimental  resources  for  biomechanics,  there  remains  a  need  to 
understand  how  structural  damage  in  a  given  location  of  the  brain 
evolves,  and  how  it  may  influence  functional  or  cognitive 
performance  over  time.  Such  a  goal  is  complex  and  difficult.  For 
example,  as  Kaiser  et  al.  point  out  [34],  in  some  instances  the 
brain  can  be  robust  to  physical  damage,  and  in  other  instances 
physical  damage  can  cause  severe  functional  deficits.  Nevertheless, 
Kaiser  et  al.  [34]  pursue  the  important  and  unanswered  question: 
Are  the  severity  and  nature  of  the  effects  of  localized  damage 
predictable?  To  help  explore  the  answer  to  this  question,  tools  are 
being  developed  for  the  quantitative  analysis  of  brain  network 
organization,  based  largely  on  graph  theory  [35,36].  Typically, 
network  nodes  represent  brain  regions,  often  obtained  from  high 
resolution  magnetic  resonance  imaging  (MRI).  The  network  links 
or  edges  between  brain  regions  represent  interregional  pathways 
that  convey  neuronal  signals  and  are  commonly  obtained  from 
non-invasive  DTI  or  diffusion  spectrum  imaging  (DSI)  [37,38]. 
The  connection  matrix  of  the  network  of  the  human  brain  forms 
the  so-called  "human  connectome"  [35,36,39,40].  Recently,  Jirsa 
et  al.  [41]  used  connectomics  to  establish  a  framework  for  a 
"virtual  brain",  in  which  network  modeling  is  used  to  understand 
the  intact  and  damaged  brain.  Similar  to  previous  work, 
hypothetical  or  random  deletion  of  nodes  or  edges  were  used  to 


degrade  the  structural  connectome  [42] .  More  recently,  structural 
and  diffusion  images  from  14  healthy  subjects  were  used  to  create 
spatially  unbiased  white  matter  connectivity  importance  maps  that 
quantify  the  amount  of  disruption  to  the  overall  brain  network  that 
would  be  incurred  if  that  region  were  compromised  [43] .  In  this 
study,  we  attempt  to  extend  the  capabilities  of  neurocomputational 
models  by  providing  a  physics-based  approach  for  predicting 
degraded  regions  of  interest.  Physics-based  injury  predictions  may 
help  inform  structural  connectome  analysis. 

In  this  study,  neurotrauma  is  investigated  by  using  finite  element 
simulations  of  a  single  individual  subjected  to  a  simulated  head 
impact.  Tissue  damage  is  computed  using  empirically  based 
damage  models  that  provide  a  link  from  macroscopic  biomechan- 
ical deformation  to  mesoscopic  damage.  Axonal  bundle  tracts  are 
explicitly  modeled  using  a  multiscale  description  of  white  matter 
tracts  obtained  from  diffusion  tensor  imaging.  Then,  using  the 
physics-based  injury  predictions  for  white  matter  tissue  from  finite 
element  simulations,  the  structural  brain  connectivity  or  con- 
nectome is  degraded,  and  various  network  measures  are  comput- 
ed. This  is  an  important  contribution  because  finite  element 
simulation  predictions  of  tissue  damage  provide  physics-based 
reasoning  for  removing  nodes  or  degrading  edges  to  create  the 
"damaged"  brain  connection  matrix.  In  turn,  this  approach  may 
provide  further  insight  into  mild  traumatic  brain  injury  by 
shedding  light  on  the  relationship  between  mechanical  stimulus 
to  the  brain  and  neurobiological  processes  that  result.  Further- 
more, if  successful,  the  computational  framework  presented  herein 
could  supplement  ongoing  efforts  to  evaluate  the  use  of  non- 
invasive medical  imaging  tools,  such  as  diffusion  tensor  and 
spectrum  imaging,  to  detect  white  matter  disruption  for  neuro- 
trauma diagnostics  [44,45]  by  providing  a  time-evolving  history  of 
tissue  injury. 

Methods 

A  suite  of  medical  imaging  and  software  tools  are  used  to  obtain 
an  individual- specific  finite  element  model  and  structural  con- 
nectome-based  analysis.  The  overarching  process  is  schematically 
shown  in  Figure  1  and  is  outlined  below. 

Individual  Specific  Models 

Tl  and  diffusion  tensor  magnetic  resonance  images  are  taken 
from  a  single  individual  (the  corresponding  data  can  be  found  in 
[37]).  The  Tl  image  (Figure  la)  is  segmented  into  different  head 
materials  (Figure  lb)  using  the  software  Amira  [46]  and  the 
Connectome  Mapper  Toolkit  [47].  The  segmented  geometry  is 
then  used  to  create  a  biofidelic  three-dimensional  finite  element 
volume  mesh  (Figure  Ic).  In  order  to  create  a  corresponding 
structural  connectome  or  network,  the  T 1  image  is  parcellated  into 
83  regions  of  interest  (ROI)  representing  the  location  of 
anatomical  regions  of  the  brain  based  on  the  Desikan-Killiany 
atlas  extended  to  include  subcortical  regions  [48].  Since  diffusion 
tensor  images  (Figure  1  e)  are  used  to  generate  axonal  bundle  fiber 
tractography  based  on  the  direction  of  peak  water  diffusion  in  each 
voxel  [49],  the  DTI  fiber  tractography  (Figure  If)  represents  the 
approximate  location  of  neuronal  axonal  bundles  [50] .  Fibers  are 
filtered  for  connectome  creation  to  include  only  fibers  that  begin 
and  end  within  ROI.  The  structural  connectome  is  assembled 
using  the  Connectome  Mapper  Toolkit  [47]  and  is  composed  of 
nodes  representing  ROI  generated  at  the  centroid,  connected  by 
edges  that  represent  structural  pathways  for  which  the  DTI 
tractography  traverses.  Following  the  segmentation  enabled 
by  the  Desikan-Killiany  brain  atlas,  there  are  83  network  nodes 
and  1029  network  edges.  Furthermore,  DTI  fiber  tractography  is 
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Figure  1.  A  flowchart  of  the  process  for  creating  the  finite  element  model  and  connectome  from  T1  and  diffusion  MR!,  (a)  The  T1 

image  is  (b)  segmented  into  different  head  materials.  The  segmented  geometry  is  then  used  to  create  (c)  a  biofidelic  three-dimensional  finite  element 
volume  mesh.  The  mesh  is  required  for  (d)  an  explicit  dynamic  finite  element  simulation  that  captures  the  biomechanical  response  from  frontal 
impact,  (e)  Diffusion  tensor  images  are  used  to  generate  (f)  axonal  bundle  fiber  tractography  which  is  used  to  inform  finite  element  transversely 
isotropic  constitutive  descriptions  of  white  matter  tissue  behavior  (c).  Tractography  is  also  used  to  create  (g)  a  network  model  of  the  brain  that  can  be 
(h)  degraded  over  time. 
doi:1 0.1 371/journal.pcbi.1 00261 9.g001 
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incorporated  into  the  finite  element  model  by  using  a  transverse 
isotropic  material  model  specifically  developed  for  representing 
white  matter  tissue  [23,51]  where  each  finite  element  within  white 
matter  regions  is  assigned  an  orientation  based  on  the  superpo- 
sition of  fiber  tractography  [52].  Details  of  the  numerical 
implementation  are  published  elsewhere  [52],  but  it  is  important 
to  point  out  that  various  different  cases  needed  to  be  considered 
while  assigning  one  orientation  per  finite  element.  For  example,  in 
the  case  of  multiple  fibers  that  overlap  the  spatial  bounds  of  a 
single  element,  the  multiple  fiber  orientations  were  averaged. 

Damage  and  Injury  Thresholds  for  Coupling  FEM  Results 
to  the  Connectome  Analysis 

Herein,  finite  element  simulations  of  the  human  head  are 
designed  to  mimic  experimental  conditions  for  cadaveric  impact 
tests,  which  are  conducted  to  understand  the  dynamics  of  a  frontal 
impact  and  the  associated  compression-tension  damage  [53].  In 
this  study,  the  explicit  dynamic  finite  element  method  [54]  is  used 
to  capture  the  transient  response  of  head  impact.  Supplementary 
information  about  the  finite  element  method  is  included  in  the 
supporting  information.  Text  S 1 .  In  addition  to  a  volume  mesh  for 
the  head,  the  finite  element  model  also  requires  material 
constitutive  descriptions  and  properties  for  all  its  components: 
skull,  cortex,  brain  stem,  cerebrospinal  fluid,  and  soft  tissue,  which 
represents  a  homogenized  mixture  of  muscle  and  skin.  A  detailed 
list  of  the  material  constitutive  laws  used  to  relate  tissue  strain  to 
stress  is  included  in  the  supporting  information.  Text  S2.  Once 
again,  we  point  out  that  diffusion  tensor  imaging  is  used  to  inform 
the  constitutive  model  for  white  matter  tissue,  which  has  been 
applied  in  the  past  for  studying  non-human  primates  [31]  and 
human  injury  [8,22,55].  The  entire  finite  element  model  consisted 
of  1,394,945  tetrahedral  elements  and  237,115  finite  element 
nodes.  For  boundary  conditions,  the  bottom  of  the  neck  is  fixed, 
and  a  force  was  applied  to  a  circular  area  on  the  forehead  (about 
3  mm^)  in  the  anteroposterior  direction.  The  input  force-time 
curve  was  a  sinusoidal  shape  with  a  peak  force  of  7,000  N  at 
2.75  ms.  The  finite  element  simulation  computes  the  time-evolving 
mechanical  strain  and  stress  in  the  direction  of  axonal  fiber 
bundles.  The  strain  in  the  direction  of  axonal  bundles  is  referred  to 
as  the  axonal  strain. 

One  limitation  of  the  current  finite  element  model  is  the 
exclusion  of  viscoelasticity  in  the  constitutive  description  of  brain 
matter.  The  authors  acknowledge  that  to  accurately  model  the 
progression  of  damage,  the  constitutive  model  should  be  extended 
to  account  for  the  time-dependent  behavior  of  brain  tissue.  The 
exclusion  may  have  an  effect  on  the  outcome  of  our  results,  leading 
to  larger  shear  stresses,  but  smaller  shear  strains,  thus,  less 
predicted  damage.  For  example,  Chafi  et  al.  [56]  shows  that 
viscoelasticity  plays  a  major  role  in  the  dynamic  response  of  the 
brain  under  blast  loading.  Future  efforts  are  focused  on  improving 
the  mechanical  description  of  brain  tissue. 

In  order  to  model  damage  using  a  physics-based  approach, 
either  an  explicit  failure  mechanism  should  be  modeled  or  an 
empirically  based  failure  threshold  is  required.  For  this  study, 
measures  of  axonal  strain  and  strain  rate  computed  for  white 
matter  regions  are  used  as  input  for  empirically  based  injury 
threshold  predictions  that  are  obtained  from  cellular  culture 
experiments.  Specifically,  experimental  results  for  cellular  death 
are  described  using  a  mathematical  function  for  tolerance  criteria 
that  relates  strain  to  resultant  cell  death  evaluated  for  up  to  four 
days  post-injury  [27,32,33].  Experimental  data  exists  for  the  CAl 
and  CA3  regions  of  the  hippocampus,  dentate  gyrus,  and  cortex 
for  the  rat.  Experiments  suggest  that  some  regions  of  the  rat  brain 
are  sensitive  to  loading  rate,  while  other  brain  regions  are  not. 


That  is,  Elkin  et  al.  [33]  found  that  cortical  cell  death  was 
dependent  on  applied  strain  rate,  whereas  hippocampal  cell  death 
was  not.  The  relations  used  in  the  present  model,  obtained  from 
Morrison  III  et  al.  [27],  are: 

i)c.i/c.3  =  0.0389    [e   °-'^«]    [t  (1) 
Z)z,G  =  0.0323    [e  [t    1-8209]  (2) 

Z)c„«„  =  0.094    [e    '-"^^j     [t  [s    O-i^S]  (3) 

where  t  is  time  from  insult,  e  is  the  local  strain,  and  s  is  the  local 
strain  rate  [27].  Similar  to  Morrison  III  et  al.  [27],  the  units  of 
time  are  days,  strain  is  dimensionless  and  strain  rate  is  inverse 
seconds.  The  damage  parameter,  D,  is  defined  as  the  percent  area 
of  cell  death.  Using  Equations  1-3,  the  axonal  strain  and  strain 
rate  from  the  finite  element  simulations,  as  well  as  time,  are  used  to 
calculate  percent  cell  death  for  a  tissue  region.  Due  to  insufficient 
resolution  of  the  MRI  data  used  in  this  study,  segmentation  of  the 
hippocampus  into  CAl,  CA2  and  dentate  gyrus  regions  was  not 
possible,  thus  the  more  conservative  equation  for  Dj)q  was  used  to 
compute  cell  death  for  the  entire  hippocampus.  Equations  1-3  are 
monotonic  functions  that  only  increase  with  time,  thus  cellular 
repair  mechanisms  and  regeneration  are  not  currently  captured. 
Potential  issues  with  monotonic  cellular  death  predictions,  as  well 
as  using  rat  brain  injury  thresholds  instead  of  human  cellular 
injury  thresholds,  will  be  discussed  later.  Since  the  explicit 
dynamic  finite  element  method  is  used,  the  transient  wave 
propagation  for  solid  mechanics  is  resolved;  however,  this  is 
computationally  costly  and  limits  the  total  time  of  biomechanical 
prediction.  The  explicit  dynamic  simulation  runs  to  15  ms,  thus 
the  "long-term"  structural  mechanics  of  brain  swelling,  relaxation, 
etc.,  are  not  captured  in  the  current  model,  although  could  be 
adapted  in  the  future  by  using  quasi-static,  implicit  finite  element 
solvers. 

The  reader  should  understand  that  we  use  the  local  tissue  strain 
and  strain  rates  predicted  from  finite  element  simulations  of  short 
duration,  about  15  ms,  as  input  to  experimentally  based  cellular 
death  models  that  were  developed  over  a  96  hour  period.  This 
assumes  that  the  tissue  damage  due  to  large  deformations  occurs 
immediately  and  initiates  an  injury  process  that  grows  with  time. 
This  assumption  is  based  on  the  observation  that  cell  death  was 
not  immediate  in  response  to  deformation,  but  instead,  increased 
over  4  days  after  injury  [27]. 

It  should  be  noted  that  the  cellular  death  estimates  that 
Morrison  III  et  al.  [27]  developed  were  not  based  on  axonal  strain 
but  instead  on  nominal  strain  applied  to  the  back  of  the  substrate 
on  which  the  neuronal  cells  were  attached.  Thus,  Equations  1-3 
may  not  accurately  describe  the  actual  relationship  between 
axonal  strain  and  injury.  Furthermore,  the  neuronal  cell  bodies  in 
the  experiments  were  not  aligned  in  a  specific  orientation,  so  the 
response  is  not  strictly  representative  of  axonal  strain  injury.  There 
are  research  efforts  examining  stretching  of  individual  axons  but 
have  not  proposed  empirical  relationships  for  cellular  death  in 
terms  of  applied  strain,  rate  of  loading  and  time  from  insult 
[57,58].  Experimental  measurements  of  the  cellular  response  of 
white  matter,  especially  axonal  bundles,  would  be  interesting  to 
explore  and  could  provide  a  more  accurate  representation  of  cell 
damage  in  the  future. 

Furthermore,  the  experiments  performed  by  Morrison  III  et  al. 
[27]  used  the  rat  hippocampus,  which  is  mostly  comprised  of  gray 
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matter.  Thus,  it  should  be  noted  that  the  empirical  data  described 
in  Equations  1-3  was  not  intended  to  describe  white  matter  injury 
response  so  there  may  be  limitations  in  applying  Equations  1-3  to 
predict  cellular  white  matter  injury.  Cellular  injury  threshold  data 
for  isolated  white  matter  is  currently  limited;  however  there  have 
been  some  efforts  to  characterize  mechanical  damage  to  axons 
[57-59].  We  hope  a  computational  framework  described  here  will 
help  to  motivate  efforts  to  obtain  different  white  matter  and  gray 
matter  empirical  functions,  which  could  then  be  used  for  each  of 
region  of  the  brain  separately.  As  mentioned  before,  the 
application  of  this  approach  raises  the  possible  need  for 
understanding  properties  of  white  matter  fiber  bundles. 

In  order  to  map  the  finite  element  results  to  network-based 
analysis  tools,  output  data  from  each  finite  element  that  represents 
white  matter  is  mapped  to  a  corresponding  voxel  in  the  MRI  data 
that  is  used  to  create  the  DTI  tractography.  This  mapping  is 
referred  to  as  the  element-to-voxel  map.  Multiple  finite  elements 
within  a  single  voxel  are  averaged.  The  element-to-voxel  map 
enables  voxels  to  be  assigned  additional  data,  including  axonal 
strain  and  strain  rate  from  the  finite  element  simulation.  Alstott  et 
al.  [42]  chose  to  generate  brain  lesions  by  altering  the  structural 
connectivity  matrix  of  the  brain  by  deleting  nodes  using  various 
methods.  In  the  present  work,  instead  of  deleting  nodes,  we 
degrade  the  edges  of  the  network  based  on  the  computed  cellular 
death  at  the  voxel  level. 


To  understand  how  the  structural  network  is  degraded,  consider 
the  schematic  shown  in  Figure  2.  Edges  in  the  network  are 
constructed  from  voxels  that  connect  two  different  ROIs.  The 
edge  strength  is  relative  to  the  number  of  tracts  between  two  ROL 
Cellular  death  for  each  voxel  is  computed  using  Equations  1-3 
and  may  grow  to  reach  the  chosen  critical  value  of  cellular  death, 
Dc  (discussed  shortly).  Voxels  with  a  predicted  cellular  death 
greater  than  Dc  are  shown  in  red  in  Figure  2.  For  this  study,  if  a 
tract  traverses  a  voxel  that  is  greater  than  the  threshold,  the  entire 
tract  is  considered  damaged,  thus  the  edge  strength  is  decreased. 
This  procedure  is  similar  to  that  used  by  Kuceyeski  et  al.  [43]  who 
simulated  lesions  at  each  voxel  and  removed  tracts  passing  through 
the  lesion  in  order  to  create  a  damaged  network  to  enable  analysis 
of  changes  in  network  measures  and  the  evaluation  of  the 
importance  of  each  voxel.  Similar  to  Honey  et  al.  [60],  the 
connection  strengths  were  resampled  to  a  Gaussian  distribution 
with  a  mean  of  0.5  and  a  standard  deviation  of  0.1.  As  Alstott  et  al. 
point  out  [42] ,  this  transformation  does  not  alter  the  rank-ordering 
of  strong  to  weak  pathways,  but  simply  compresses  the  scale  of 
connection  strengths.  There  are  other  possible  ways  to  degrade  the 
structural  network  that  will  be  discussed  later. 

Since  Equations  1-3  predict  some  degree  of  cell  death  for  non- 
zero values  of  axonal  strain,  strain  rate,  and  time,  the  additional 
critical  value  of  cell  death,  Z)^,  was  required  as  a  rule  in  order  to 
degrade  a  voxel.  The  critical  cellular  death  of  Z)^  =  3%  was  chosen 


Underlying  Voxels  with 
Cell  Death  Information 


Figure  2.  Schematic  showing  how  structural  network  edges  are  degraded  over  time.  Red  voxels  indicate  that  the  chosen  critical  cellular 
death  threshold,  Dc,  is  reached.  For  this  study,  DTI  tracts  that  traverse  a  damaged  voxel  are  removed,  thus  degrading  the  connection  strength  of  the 
network  edge. 

doi:1 0.1 371/journal.pcbi.1 00261 9.g002 
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because  it  predicted  similar  levels  of  damage  as  compared  to  other 
proposed  thresholds,  including  18%  axonal  strain,  which  was  used 
previously  as  a  threshold  that  indicated  degradation  in  electro- 
physiological function  [30].  During  the  calculation  of  cellular 
death,  positive  values  of  strain  and  strain  rate  were  used  (negative 
strain  was  not  used  to  calculate  cellular  death).  The  choice  of 
injury  threshold,  as  well  as  the  micromechanics  of  axonal  fiber 
bundles,  is  an  active  area  of  research  that  would  assist  making  the 
current  methodology  more  accurate  in  the  future.  During  the 
15  ms  dynamic  simulation,  strain  and  strain  rate  data  for  each 
voxel  is  output  at  0.1ms  increments.  Cell  death  is  calculated  at 
each  voxel  for  each  of  these  increments.  Then,  the  maximum  cell 
death  value  calculated  in  each  voxel  is  used  to  predict  cell  death  up 
to  96  hrs.  It  is  important  to  note  that  cell  death  calculations  are  the 
result  of  the  combination  of  variables  at  each  increment  rather 
than  considering  each  variable  independently. 

Results 

Finite  Element  Analysis  of  Head  Impact 

The  deformed  configurations  of  the  head,  along  with  contours 
and  response  curves  for  various  locations  within  the  brain  (frontal, 
parietal,  occipital,  temporal,  corpus  callosum  and  cerebellum)  are 
shown  in  Figures  3a-f  Output  variables  including  pressure,  axonal 
strain,  and  effective  strain  rate,  useful  for  understanding  the 
anisotropic  biomechanical  response  for  white  matter,  are  shown. 
Confidence  in  the  finite  element  model  is  established  by 
comparing  output  from  the  computations  to  pre-existing  experi- 
mental data  on  cadaveric  head  impact  [53].  The  values  of  axonal 
strain  and  strain  rate  were  taken  from  specific  locations  in  each  of 
the  six  regions  plotted  in  Figure  3.  Four  out  of  six  of  the  locations 
represented  the  locations  of  the  pressure  transducers  for  the 
experiments  of  Nahum  et  al.  [53].  Locations  within  the  cerebellum 
and  corpus  callosum  were  also  added  for  the  strain  and  strain  rate 
analysis.  A  variation  of  results  will  exist  in  each  region,  and  taking 
an  average  of  values  in  a  anatomical  region  could  help  resolve  this 
problem,  but  including  simulation  data  points  that  are  located 
farther  from  the  experimental  sampling  points  could  also  make  the 
results  less  accurate.  The  head  impact  simulation  took  about 
30  hours  on  32  processors  in  order  to  reach  15  ms. 

As  seen  in  Figure  3b,  the  intracranial  pressure  quickly  increases 
to  positive  values  in  the  frontal  and  parietal  regions,  while  quickly 
increasing  to  negative  pressures  in  the  occipital  and  posterior  fossa 
regions.  The  anteroposterior  pressure  gradient  (seen  in  Figure  3a) 
is  commonly  observed  in  experimental  and  computational  studies  - 
giving  rise  to  the  so-called  coup  and  contrecoup  loading  scenario, 
for  which  there  are  an  associated  number  of  proposed  injury 
mechanisms.  Short  duration  intracranial  pressure  gradients  with 
high  positive  pressures  are  observed  at  the  coup  region,  with 
negative  pressures  at  the  contrecoup  region.  The  maximum 
positive  pressure  of  approximately  160kPa  is  reached  in  about 
2.5  ms  in  the  frontal  lobe,  closest  to  the  impact,  while  the 
maximum  negative  pressure  of  approximately  —  80  kPa  is  reached 
in  about  2.0  ms  in  the  posterior  fossa  region.  The  computed 
pressure  response  is  directly  compared  to  Nahum  et  al.'s  [53] 
experimental  results  in  Figure  3b  and  show  similar  trends.  In 
addition,  validation  of  the  strain  response  against  cadaveric 
experiments  of  Hardy  et  al.  [61]  is  also  described  in  the 
Supplemental  Text  S3. 

Axonal  strain  at  the  various  brain  regions  responds  slower  than 
the  pressure  response.  The  axonal  strain  begins  to  substantially 
grow  at  1  ms  and  shows  a  gradual  rate  of  change  of  strain,  with  a 
maximum  of  33%  at  12  ms  over  the  duration  of  the  simulation 
within  the  temporal  region.  Unlike  the  pressure,  an  obvious 


transcranial  gradient  is  not  apparent  for  the  axonal  strain.  While 
the  frontal  region  had  the  highest  predicted  pressure,  the  temporal 
and  occipital  brain  locations  have  the  largest  values  of  axonal 
strain  in  the  regions  specifically  examined.  Later,  in  the  structural 
network  analysis,  these  areas  are  associated  with  the  largest 
amount  of  cellular  death.  Interestingly,  if  a  threshold  of  injury  of 
18%  axonal  strain  is  chosen  [8,30],  our  results  show  the  onset  of 
injury  occurs  at  approximately  9.1  ms  within  the  temporal  lobe, 
despite  the  intracranial  pressure  reaching  approximately  100  kPa 
in  2  ms.  The  time  scales  at  which  the  pressure  and  strain  grow  will 
be  discussed  later  in  the  context  of  injury  cascades. 

The  effective  strain  rate,  also  commonly  referred  to  as  rate  of 
loading  or  loading  rate,  has  a  maximum  value  of  approximately 
85  s~^  in  the  temporal  lobe.  From  the  contours  shown  in  Figure  3e 
there  appears  to  be  a  strain  rate  focusing,  with  lower  strain  rates 
closer  to  the  skull  and  higher  values  more  central  to  the  brain. 
Shear  strain  focusing  has  been  reported  earlier  [3,62]  and  is 
attributed  to  the  partial  conversion  of  energy  of  the  axial  impact 
into  a  shear  mechanical  stress  wave  as  a  result  of  the  material 
response  of  the  brain,  cerebrospinal  fluid,  and  skull  [62].  The 
maximum  strain  rate  is  observed  before  maximum  axonal  strain 
because  the  axonal  strain  accounts  for  magnitudes  of  deformation, 
while  the  strain  rate  relates  to  the  rate  of  change  of  strain. 

FEM  Informed  Structural  Connectome  Analysis 

The  axonal  strain  and  strain  rate  output  from  the  finite  element 
simulations  are  used  to  compute  the  amount  of  cellular  death, 
according  to  Equations  1-3.  Figure  4  shows  the  evolution  of 
damaged  tractography  up  to  96  hrs  post-impact,  using  a  critical 
cellular  death  of  1)^  =  3%  as  a  threshold  for  injury  and  the 
corresponding  evolution  of  the  degraded  structural  network  for 
sagittal  and  transverse  views.  The  edges  in  the  network,  which  are 
fully  damaged,  are  shown  in  red  for  visualization.  In  reality,  each 
edge  is  weighted  and  has  degraded  values  before  it  is  fully 
damaged.  The  network  nodes  are  scaled  by  the  percentage  of 
connections  that  were  removed,  so  that  larger  nodes  have  lost 
more  connections  compared  to  original  values  (1  —  [DegreCnew 
/Degreeoriginal])-  The  sagittal  and  transverse  views  are  shown  to 
provide  insight  about  where  the  damage  is  predicted  and  how  it 
progresses  through  time.  At  24  hrs,  the  top  four  regions  affected 
include  the  cuneus  (medial  surface  of  left  cerebral  hemisphere  in 
the  occipital  lobe),  fusiform  gyrus  (temporal  lobe),  lingual  gyrus 
(occipital  lobe),  and  peri-calcarine.  Together,  damage  in  these 
brain  regions  have  559  tractography  fibers  removed  from  network 
edges.  Note  that  the  total  number  of  fiber  tracts  prior  to  impact 
was  497,442,  so  this  damage  corresponds  to  a  0.01 1  %  degradation 
of  tractography.  The  percentage  of  fully  degraded  edges  for  24,  48, 
72,  and  96  hrs  are  listed  in  Table  1.  Using  an  18%  strain  criterion, 
17.3%  of  the  edges  were  fully  degraded. 

Figure  4b  shows  that  structural  damage  occurs  first  in  left 
cuneus  and  the  right  superiotemporal  regions,  resulting  in  one  fully 
damaged  edge  at  24  hrs.  The  edge  between  the  two  regions  had 
only  three  tract  fibers  with  less  than  one  voxel  or  2  mm  between 
them.  The  edge  also  had  a  high  mean  fiber  length  of  131.7mm 
(the  original  total  mean  fiber  length  was  81.7  mm).  This  suggests 
that  network  edges  associated  with  few  tract  fibers  and  long  fiber 
length  are  more  susceptible  to  damage.  Asymmetry  of  the 
predicted  damage  occurs  due  to  the  asymmetry  of  the  underlying 
anatomy  associated  with  the  finite  element  mesh,  as  well  as 
potential  asymmetry  of  the  fiber  tractography,  which  has  been 
studied  in  recent  work  [63].  At  48  hrs,  predicted  damage  seems  to 
show  a  high  density  of  damaged  fibers  in  the  anteroposterior 
direction  in  both  hemispheres  of  the  brain.  Lateral  tractography 
damage  is  also  predicted  within  the  corpus  coUusum.  As  time 
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Figure  3.  Orthographic  view  of  the  local  three-dimensional  response  measured  at  various  locations  in  the  brain  for  (a-b)  pressure, 
(c-d)  axonal  strain  and  (e-f)  strain  rate  predicted  using  a  finite  element  simulation  for  impact  to  the  head.  Prediction  of  tlie  intracranial 
pressure  response  is  compared  to  cadaveric  experiments  [53]  and  is  sliown  in  (b). 
doi:1 0.1 371/journal.pcbi.1 00261 9.g003 


progresses,  fully  damaged  rostrocaudal  tracts  are  predicted  and  goes  through  a  voxel  that  has  reached  the  critical  cellular  death 

continue  to  increase.  value,  Dc,  the  tract  is  removed.  Therefore,  it  is  useful  to  examine 

Structural  changes  to  the  tractography  and  resulting  network  how  the  inherent  voxel  properties  influence  results.  Figure  5a 

arise  because  of  the  underlying  voxel  condition.  That  is,  if  a  tract  shows  the  distribution  of  voxels  above  the  predicted  critical  cellular 
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Sagittal  View  Transverse  View 


Figure  4.  Evolution  of  damaged  tracts  and  the  corresponding  structural  networks.  Using  empiricdlly-based  cellular  death  predictions 
obtained  from  in  vitro  models  of  neural  tissues,  local  strain  and  strain  rate  values  computed  from  finite  element  simulations  are  used  to  specify  injury. 
A  computed  cellular  death  of  3%  was  used  as  a  critical  value  for  defining  white  matter  disruption.  Damage  is  shown  in  red  and  the  node  size 
represents  the  percent  change  of  degree.  The  predicted  evolution  of  damage  is  shown  for  the  sagittal  and  corresponding  transverse  views  for  24  (a 
and  b),  48  (c  and  d),  72  (e  and  f),  and  96  hours  (g  and  h). 
doi:1 0.1 371/journal.pcbi.1 00261 9.g004 


death  of  3%  as  a  function  of  angle  between  the  axonal  fiber  bundle 
direction  and  the  direction  of  the  head  impact  (i.e.,  frontal  impact 
on  the  anteroposterior  axis).  If  the  tract  fiber  direction  within  a 


Table  1.  The  percentage  of  fully  degraded  edges  and 
percentage  of  voxels  above  the  3%  threshold  for  24,  48,  72, 
and  96  hrs  post-injury. 


Post-Injury  (hrs) 

%  Fully  Damaged 

%  Voxels  Above 

Edges 

3%  Threshold 

24 

0.097 

0.008 

48 

7.19 

1.1 

72 

14.1 

2.7 

96 

19.7 

4.2 

doi:1 0.1 371  /journal.pcbi.1 00261 9.t001 


voxel  is  parallel  with  the  impact  direction,  the  angle  is  zero.  This 
shows  the  distribution  of  damage  as  a  function  of  angle  with  respect 
to  the  impact  direction.  Because  of  the  high  degree  of  mechanical 
rotation  observed  through  the  shear  focusing  in  the  temporal  and 
occipital  brain  regions,  tract  fibers  with  large  angles  with  respect  to 
the  impact  direction  are  damaged  initially.  This  is  seen  in  Figure  5a 
at  24  hrs  (see  red  bars  in  plot)  and  in  Figure  4a-b. 

This  data  can  be  further  analyzed  by  normalizing  the  number  of 
damaged  voxels  for  a  given  orientation  by  the  total  number  of 
undamaged  voxels  with  a  given  orientation  from  the  impact 
direction.  This  measure  is  referred  to  as  the  angle-normalized 
number  of  damaged  voxels.  The  distribution  of  angle-normalized 
number  of  voxels  is  shown  in  Figure  5b,  and  is  important  because 
it  takes  into  account  the  original  distribution  of  the  number  of  tract 
fibers  eligible  to  be  damaged  with  respect  to  the  impact  direction. 
For  example.  Figure  5a  shows  18  voxels  above  the  3%  cell  death 
threshold  at  48  hrs;  thus,  one  might  assume  that  axonal  bundles 
oriented  0—  10°  from  the  impact  direction  have  little  importance. 
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Figure  5.  Distribution  of  (a)  the  number  of  voxels  with  predicted  cellular  death  above  3%  for  each  day  as  a  function  of  angle 
between  axonal  fiber  bundle  direction  and  direction  of  impact  to  the  head,  and  (b)  the  number  of  voxels  with  predicted  cellular 
death  above  3%  for  each  day  as  a  function  of  angle  between  axonal  fiber  bundle  direction  and  direction  of  impact  to  the  head 
normalized  by  the  initial  distribution  of  fiber  angle  with  respect  to  the  loading  condition. 
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However,  when  the  original  number  of  voxels  available  to  be 
damaged  is  taken  into  consideration  (Figure  5b),  11—20°  angles 
have  the  highest  percentage  of  damage  at  48  hrs.  In  other  words, 
depending  on  the  underlying  white  matter  microstructure,  i.e.,  the 
axonal  bundle  orientation,  with  respect  to  the  direction  of  impact, 
the  node,  and  edge  degradation  in  the  structural  network  may  be 
affected  differently.  This  is  directly  related  to  the  structural 
mechanics  (as  opposed  to  structural  connectomics)  of  the 
underlying  constitutive  or  material  law  used  within  the  finite 
element  simulation. 

However,  because  a  monotonic  function  is  used  to  describe  the 
empirical  cellular  death  prediction,  this  trend  becomes  more  dilute 
as  time  progresses  (but  should  not  be  extrapolated  past  96  hrs 
since  Equations  1-3  are  not  validated  beyond  that  time).  From 
Figure  4g-h,  at  96  hours  the  predicted  damaged  axonal  pathways 
can  be  seen  in  many  directions  and  across  all  white  matter  brain 
regions  indicating  diffuse  structural  degradation.  To  summarize,  at 
24  hrs  fibers  in  the  areas  of  large  rotational  tissue  strain  are 
susceptible,  while  at  greater  times,  neuronal  tracts  that  align  with 
the  direction  of  impact  seem  more  susceptible  to  damage  (using 
the  assumed  threshold  of  cellular  death).  In  the  future,  additional 
empirical  cellular  death  predictions  that  are  non-monotonic  (if 
valid),  or  cellular  regeneration  and  repair  models,  would  be  useful 
to  explore. 

Figure  6a-e  shows  the  evolution  of  the  structural  connectivity 
strength  matrices  predicted  as  a  result  of  head  impact  simulations. 
The  original  structural  connectivity  strength  is  defined  as  a 
Gaussian  distribution,  created  by  using  a  mean  of  0.5  and 
standard  deviation  of  0.14  distributed  over  the  total  number  of 
edges  in  the  network.  Each  figure  represents  a  snapshot  in  time 
starting  with  t  =  0  and  ending  at  96  hours.  Within  the  96  hr  period 
(for  which  the  monotonic  cellular  death  criteria  are  validated),  as 
long  as  regions  have  non-zero  strain  and  strain  rate  computed 


from  the  finite  element  simulation,  edges  in  the  network  become 
degraded.  This  decline  in  network  strength  is  evident  in  Figure  6e. 
Since  a  3%  critical  cellular  death  value  is  only  one  possible  injury 
criteria  that  could  be  chosen,  an  additional  criterion  was 
examined.  Results  using  two  different  damage  thresholds  are 
shown  in  Figure  6e  and  f  Figure  6e  is  the  connection  matrix  using 
the  3%  threshold  at  96  hrs,  while  Figure  6f  is  connection  matrix 
using  the  18%  axonal  strain  threshold  at  /=15ms,  as  used  in 
previous  work  [8,31].  Connection  strengths  show  qualitatively 
similar  trends  in  magnitude  and  location  of  edge  degradation. 
While  the  two  criteria  offer  a  similar  prediction  of  network 
damage,  the  cellular  death  Equations  1-3  are,  perhaps,  more 
useful  because  effects  of  strain,  strain  rate,  and  time  have  been 
decomposed  into  separate  multiplicative  terms  that  allow  a 
compartmentalized  study  of  extrinsic  biomechanical  conditions. 

In  general,  a  network's  global  efiiciency  represents  how  well- 
connected  the  network  is  compared  to  a  perfectly  connected 
network  [64]  and  captures  the  network's  capacity  for  communi- 
cation along  short  paths  [65].  Herein,  values  are  reported  as 
normalized  global  efficiency,  which  is  the  global  efficiency  of  the 
network  divided  by  the  efficiency  of  an  ideal  network.  Ideal 
network  efficiency  is  calculated  as  a  network  where  all  nodes  are 
connected  at  the  minimum  cost.  The  cost  for  each  edge  is  defined 
as  unity  minus  Gaussian  strength.  While  strength  offers  a  measure 
of  the  capacity  to  send  information,  the  cost  indicates  the 
resistance  to  sending  information  -  connections  with  high  strength 
are  low  in  cost.  Local  efficiency  is  a  network  measure  that  Latora 
[65]  suggests  helps  to  reveal  the  fault  tolerance  of  the  network 
system  and  shows  how  efficient  communication  between  first  node 
neighbors  is.  For  this  study,  local  efficiency  is  calculated  for  the 
same  set  of  nodes  used  to  make  the  local  network  of  the 
undamaged  network,  even  if  a  node  loses  its  edge  to  the  primary 
node.  This  method  is  used  to  provide  a  more  direct  comparison  of 
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Figure  6.  Connection  strength  matrices  showing  degradation  over  time,  t.  Connection  strengths  were  resampled  to  a  Gaussian  distribution 
witli  a  mean  of  0.5  and  a  standard  deviation  of  0.1.  Because  of  tlie  monotonic  cellular  death  criterion,  as  long  as  regions  have  non-zero  axonal  strain 
and  strain  rate  from  the  finite  element  simulation,  edges  in  the  network  eventually  become  degraded.  The  connection  strength  matrices  at  /  =  0  hrs  is 
shown  in  (a).  The  evolution  of  connection  strength  matrices  for  0  (a),  24  (b),  48  (c),  72  (d),  and  96  hours  (e)  are  shown,  as  well  as  (f)  the  connection 
strength  matrix  for  the  case  when  a  18%  strain  threshold  is  used. 
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the  network  after  damage  by  accounting  for  edges  that  were 
removed.  As  a  result,  edges  can  no  longer  be  part  of  a  short  path 
between  nodes  and  cannot  contribute  to  the  efiiciency,  while  also 
taking  into  account  all  nodes  that  should  be  a  part  of  the  local 
network  without  damage. 

The  normalized  global  and  mean  local  efficiencies  as  a  function 
of  time  are  shown  in  Figure  7.  The  normalized  global  efficiency  is 
approximately  0.14  at  /  =  Ohrs  and  is  about  the  same  at  24  hrs, 
indicating  the  network  remains  capable  to  send  information. 
However,  at  48  hrs  an  8.8%  reduction  in  normalized  global 
efficiency  is  predicted  and  continues  to  reduce  with  time.  A  similar 
trend  is  observed  in  the  mean  local  efficiency.  At  96  hrs  the 
normalized  global  efficiency  was  reduced  24%,  while  the  mean 
local  efficiency  was  reduced  27%.  By  96  hrs,  all  but  sixteen  nodes 
had  greater  than  20%  reduction  of  local  efficiency.  Also,  note  that 
there  were  no  nodes  completely  disconnected  from  the  network, 
although  there  were  edges  completely  removed.  The  total  number 
of  edges  at  /  =  Ohrs  was  1029;  at  /  =  96hrs,  203  edges  were 
removed  using  the  cellular  death  threshold.  Using  an  injury 
threshold  of  18%  strain,  161  edges  were  removed. 


Watts  and  Strogatz  [66]  define  the  small-world  network  based  on 
the  clustering  coefficient  of  the  network  and  the  characteristic  path 
length  of  the  network.  The  clustering  coefficient,  C,  is  the  fraction  of 
triangles  around  a  node  and  is  weighted  by  the  geometric  mean  of 
weights  associated  with  edges  of  a  triangle  [64] .  It  measures  how 
well  the  first  neighbors  of  a  node  are  connected  to  each  other.  The 
characteristic  path  length,  L,  is  the  average  shortest  path  between  all 
nodes  of  the  network.  To  qualify  as  small-world,  the  networks 
clustering  coefficient  should  be  greater  than  that  of  an  equivalent 
random  network,  whereas  the  characteristic  path  length  should  be 
approximately  equal  [66] .  The  small-world  coefficient  is  defined  as 
(C/Crand)/(V^rand)  [64].  This  valuc  was  found  to  be  1.87,  1.87, 
2.03,  2.31,  and  2.64  for  t  =  0,  24,  48,  72,  and  96  hrs,  respectively. 
When  using  axonal  strain  as  the  threshold  variable,  a  value  of  2.38 
was  computed.  The  increase  seen  in  the  small-world  coefficient  is 
due  to  the  more  rapid  decrease  in  clustering  coefficient  of  the 
random  network  compared  to  the  damaged  network  that  it  is  based 
on.  In  addition,  the  increase  seen  in  the  small-world  coefficient  also 
shows  the  ability  of  the  network  to  maintain  its  modular  structure 
more  effectively  than  the  random  network  that  it  was  compared  to. 
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Figure  7.  Global  and  local  efficiency  in  terms  of  time  from  impact. 
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The  percent  reduction  of  local  efficiencies  and  the  associated 
betweenness  for  the  top  1 0  regions  affected  by  impact  at  96  hrs  are 
listed  in  Table  2.  Betweenness  is  defined  as  the  number  of  times  a 
node  is  part  of  a  shortest  path  between  nodes,  and  is  one  indication 
of  a  node's  network  centrality  that  help  to  describe  the 
"importance"  of  a  node  [64,65].  Hubs  within  the  connectome 
are  identified  using  betweenness  centrality.  The  maximum 
betweenness  of  the  original  network  at  t  =  0  is  226,  while  the 
minimum  value  is  zero.  Recall  from  Figures  4a-b  and  5a,  white 
matter  disruption  is  observed  in  the  left  cuneus  and  the  right 
superiotemporal  regions,  which  has  a  betweenness  centrality  of  41 
and  17,  respectively.  The  moderate  to  low  betweenness  of  the 
regions  first  affected  at  24  hrs  helps  to  explain  why  the  normalized 
global  efficiency  was  not  significantly  reduced  during  this  time  and 
shows  the  importance  of  a  robust  brain  network.  At  96  hrs,  the 
right  hemisphere  lateral  orbitofrontal  region  shows  the  largest 
degradation  in  local  efficiency  of  approximately  45.4%;  however, 
that  region  also  has  a  zero  betweenness  centrality.  On  the  other 
hand,  the  right  hemisphere  medial  orbitofrontal  region  has  a 
betweenness  of  110  and  shows  a  37.4%  reduction  of  local 
efficiency,  indicating  it  may  have  a  more  significant  effect  to  the 
brain  network.  At  96  hrs,  there  are  10  network  nodes  with 
betweenness  greater  than  100  that  have  at  least  a  20%  reduction  of 
local  efficiency.  Local  efficiency  and  betweenness  for  all  ROIs  are 
included  in  Supplemental  Figures  S2  and  S3,  respectively. 

The  results  reported  thus  far  are  based  on  a  critical  cell  death 
value  of  Z)c  =  3%,  as  well  as  a  critical  strain  threshold  of  18%. 
However,  it  is  also  useful  to  evaluate  the  sensitivity  of  the  results  to 
the  choice  of  Z)^-  An  analysis  of  network  properties  was  performed 
for  multiple  critical  cell  death  values,  in  the  range  from 
Dc  =  2— lOVo,  and  are  shown  in  Figure  8a-b.  Figure  8a  shows 
the  percent  reduction  in  total  edge  strength  as  a  function  of  the 


critical  cell  death  threshold  choice.  Figure  8b  shows  the  percent 
reduction  in  global  efficiency  as  a  function  of  the  critical  cell  death 
threshold  choice.  Results  show  that  the  choice  of  the  critical 
threshold  significantly  affects  both  network  measures  that  were 
examined.  For  example,  for  a  change  of  between  3%  or  4%, 
the  reduction  in  global  efficiency  is  23.9%  and  16.4%,  respec- 
tively. In  other  words,  from  only  a  1  %  change  in  the  choice  of 
results  in  a  7.5%  change  in  global  efficiency  reduction.  Therefore, 


Table  2.  Structural  measures  of  simulated  lesions  for  the  top 
ten  regions  ranked  according  to  percent  reduction  in  local 
efficiency. 

Brain  Region 

%  Reduction 

Betweenness 

Local  Eff.  at  96  hrs 

at  0  hrs 

Lateral  Orbitofrontal  (RH) 

45.4 

0 

Parahippocampal  (RH) 

44.2 

0 

Parsorbitalis  (RH) 

44.1 

0 

Transverse  Temporal  (RH) 

42.0 

0 

Pericalcarine  (RH) 

39.4 

1 

Parstriangularis  (RH) 

39.3 

2 

Temporal  Pole  (RH) 

39.1 

3 

Rostral  Anterior  Cingulate  (RH) 

38.4 

0 

Frontal  Pole  (RH) 

38.1 

12 

Superior  Temporal  (RH) 

37.9 

21 

RH  and  LH  refer  to  right  and  left  hemispheres,  respectively. 
doi:1 0.1 371/journal.pcbi.1 00261 9.t002 
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the  choice  of  Dc  will  be  important  in  obtaining  accurate  results 
and  highlights  the  need  for  future  experimentation  to  characterize 
cellular  injury  criteria  for  all  areas  of  the  brain  in  order  to  improve 
the  accuracy  of  this  modeling  approach. 

Discussion 

For  the  first  time,  a  physics-based  model  has  been  linked  to  a 
network-based  analysis  that  establishes  a  coupled  computational 
method  to  study  the  effects  of  localized  structural  damage  or 
lesions.  In  the  presented  work,  lesions  are  simulated  by  using  a 
state-of-the-art  finite  element  model  of  the  human  head,  developed 
for  a  single  individual  directly  from  MRI,  impacted  on  the 
forehead  region.  The  local  finite  element  variables  are  coupled  to  a 
network-based  analysis  through  an  empirically  based  cellular 
injury  model.  The  new  approach  attempts  to  capture  the 
spatiotemporal  structural  characteristics  of  brain  trauma.  Foun- 
dations of  this  idea  stem  from  previous  studies  that  attempted  to 
establish  relationships  of  lesion  sites  and  the  resulting  functional 
impact  [42,67]. 

The  first  part  of  our  study  involved  developing  a  new  three- 
dimensional  human  head  finite  element  model  for  simulating  the 
biomechanical  response  from  frontal  impact  and  comparing  it  to 
experimental  data  obtained  from  cadavers.  The  finite  element 
model  is  unique  in  that  it  uses  diffusion  tensor  imaging 
tractography  to  inform  structural  mechanics  constitutive  laws  of 
the  underlying  white  matter  anisotropy  in  an  effort  to  help 
elucidate  the  injury  mechanisms  of  impact  neurotrauma.  Simula- 
tions of  frontal  impact  capture  the  coup  and  contrecoup  loading 
profiles,  including  short  duration  intracranial  pressure  gradients 
with  high  positive  pressures  at  the  coup  region,  and  negative 
pressures  at  the  contrecoup  region  (Figure  3a-b).  The  computed 
pressure  compares  well  to  past  experimental  results  [53].  While  the 
frontal  region  had  the  highest  predicted  pressure,  the  temporal 
and  occipital  brain  locations  had  the  largest  values  of  axonal  strain 
in  the  regions  specifically  examined  and  responded  slower  than  the 
pressure  response.  Because  of  varied  mechanical  tissue  properties 


and  constraints  with  the  head,  translational  cranial  motion  causes 
relative  brain  movements  that  happens  after  peak  pressure,  and 
leads  to  large  brain  deformations  and  significant  localized  regions 
of  axonal  strain  (Figure  3e-f). 

Empirically  based  cellular  death  thresholds  were  used  to  predict 
the  time-evolving  damage  in  various  brain  regions  based  on  finite 
element-based  predictions  of  local  axonal  strain  and  strain  rate. 
The  biomechanical  simulations  predict  that  the  temporal  and 
occipital  regions  undergo  the  most  axonal  strain  and  strain  rate  at 
short  times  (less  than  24  hrs),  which  leads  to  cellular  death 
initiation  that  produces  damage  which  shows  dependence  on  angle 
of  impact  and  underlying  microstructure  of  brain  tissue.  The 
cellular  death  model  that  was  used  in  this  study  is  based  on 
experimental  observations  that  cell  death  was  not  immediate  in 
response  to  deformation,  but  instead  increased  over  four  days  after 
insult.  Tissue  damage  becomes  more  dilute  as  time  progresses 
(Figure  4g-h).  At  96  hours,  the  predicted  damaged  axonal 
pathways  can  be  seen  in  many  directions  and  across  all  white 
matter  brain  regions,  indicating  diffuse  structural  degradation. 

Interestingly,  when  using  injury  criteria  proposed  in  the  past, 
including  thresholds  of  axonal  strain  [8,30]  or  intracranial  pressure 
[21],  the  finite  element  simulations  predict  injury  within  10  ms  of 
frontal  impact.  However,  when  using  a  cellular  death  criteria, 
damage  takes  longer  to  develop  but  offers  a  similar  result  to  the 
resulting  network  strength  (Figure  6).  Thus,  while  the  model  may 
be  limited  in  the  most  accurate  description  of  cellular  injury,  it 
does  seem  to  capture  the  underlying  pathology  of  diffuse  brain 
injury,  i.e.,  widespread  damage  to  axons  in  the  brain,  to  some 
degree.  While  the  two  criteria  offer  similar  prediction  of  network 
damage,  cellular  death,  is  perhaps,  more  useful  because  effects  of 
strain,  strain  rate,  and  time  have  been  decomposed  into  separate 
multiplicative  terms  that  allow  a  compartmentalized  study  of 
extrinsic  biomechanical  conditions.  On  the  other  hand,  as 
Morrison  et  al.  [27]  point  out,  purely  mechanical  definitions  of 
injury  thresholds  may  be  too  insensitive  to  identify  the  onset  of 
injury  for  the  brain,  since  biological  tissues  are  alive  and  perform 
some  form  of  active,  physiological  functions.  For  brain  tissue. 


Critical  Cell  Death  Threshold  (%)  Critical  Cell  Death  Threshold  (%) 


Figure  8.  Changes  in  (a)  total  edge  strength  and  (b)  global  efficiency  with  change  in  critical  cell  death  threshold  for  24,  48,  72,  and 
96  hours. 
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failure  can  be  defined  in  various  ways  and  may  occur  far  below 
mechanical  failure  limits.  Therefore,  additional  tolerance  criterion 
that  capture  degradation  of  the  electrophysiological  function  may 
be  also  required  for  brain  tissue,  since  injury  mechanisms  that  may 
alter  neuronal  function  without  requiring  cell  death  are  observed 
and  may  become  activated  more  quickly  than  cell  death  [68]. 

This  work  has  attempted  to  establish  a  physics-based  method- 
ology to  inform  structural  connectome  analysis.  In  the  current 
model,  network  edges  are  degraded  by  weight  rather  than  simply 
deleting  nodes,  in  an  attempt  to  include  the  effects  of  damage  on 
white  matter  "fibers  of  passage"  that  Alstott  et  al.  [42]  refer  to.  It  is 
assumed  that  tracks  representing  bundles  of  axons  are  able  to  be 
disconnected  when  passing  through  regions  of  high  cell  death, 
simulating  reduction  in  the  strength  of  a  structural  connection,  and 
are  related  to  a  decrease  in  localized  white  matter  integrity.  All 
network  damage  was  linked  directly  to  predicted  tissue  deforma- 
tion and  predicted  cellular  death.  Interestingly,  at  96  hrs  post- 
impact,  the  methods  used  here  did  not  lead  to  any  node 
completely  disconnected  from  the  network  (although  there  were 
edges  completely  disconnected).  At  early  times  (^<  24  hrs)  network 
measures  of  global  and  local  efficiency  were  degraded  little, 
however,  as  time  increased  to  96  hrs  the  network  properties  were 
significantly  reduced  (Figures  6  and  7).  Alstott  et  al.  [42]  found  that 
random  removal  of  nodes  did  not  affect  network  integrity  until 
almost  all  of  the  nodes  were  deleted.  Thus,  this  method  may 
capture  some  structural  network  features  of  very  mild  neuro- 
trauma  at  early  times,  but  would  benefit  from  a  functional  network 
analysis  to  explore  the  potential  outcomes.  As  the  network  was 
damaged  in  certain  areas,  edges  were  lost  or  degraded,  raising  the 
cost  to  send  information  between  nodes  and  producing  different 
short  paths  between  nodes.  As  a  result,  the  brain  network  was  not 
able  to  maintain  efficiency  by  using  alternate  paths  or  finding 
strong  hub  connections  past  24  hrs  post-impact,  thus  demonstrat- 
ing a  potential  limitation  of  the  brain  to  retain  network  robustness 
in  extreme  conditions  that  cause  neurotrauma. 

While  global  efficiency  is  able  to  demonstrate  widespread  effects 
of  damage,  local  efficiency  provided  a  measure  to  investigate 
localized  damage  within  the  network  related  to  areas  of 
concentrated  axonal  strain  and  strain  rate  in  particular  areas  of 
the  brain.  There  was  a  much  larger  reduction  in  local  efficiency  at 
areas  of  high  cell  damage  compared  to  reduction  in  the 
normalized  global  efficiency,  indicating  that  the  brain  as  a  whole 
is  resistant  to  some  degree  of  localized  damage.  Brain  regions  that 
experienced  the  largest  cellular  death  showed  a  larger  reduction  in 
local  efficiency  compared  to  the  global  efficiency  of  the  network 
(Table  2).  However,  the  low  betweenness  of  many  of  these  ROI 
suggest  that  they  were  not  as  necessary  for  communication  outside 
their  local  area.  This  suggests  that  the  modular  nature  of  the 
network,  including  its  small  world  properties,  helped  to  prevent 
loss  of  efficiency  on  the  global  scale  from  damage  at  the  local  scale. 
This  is  interesting  because  the  network  considered  here  is  a  rigid, 
static,  anatomic  network  in  which  there  is  no  adaptability  built  into 
the  model.  However,  it  seems  that  damaged  structural  network 
hubs  retain  the  ability  for  long  distance  communications.  In 
addition,  the  increase  predicted  in  the  small-world  coefficient  also 
shows  that  modularity  was  not  as  affected  by  the  damaged  static 
network,  as  compared  to  a  random  network. 

Limitations  and  Future  Work 

There  are  exciting  possibilities  for  future  work,  as  well  as 
limitations  to  the  current  modeling  approach.  The  current  model 
did  not  attempt  to  model  the  coupled  effects  to  the  functional 
network;  instead,  we  provided  an  example  for  a  single  individual  in 
order  to  establish  a  methodology  to  link  physics-based  predictions 


of  tissue  damage  with  structural  network  analysis  for  frontal 
impact  neurotrauma.  It  is  important  to  note  that  the  empirical 
relationships  may  only  be  accurate  for  the  rat  (not  the  human),  and 
most  likely,  there  are  many  more  regions  that  need  to  be 
characterized.  Our  results  and  conclusions  may  be  altered 
according  to  these  injury  thresholds.  Although  human  injury 
thresholds  are  currently  limited,  as  additional  brain  region  injury 
thresholds  are  experimentally  characterized  and  improved  they 
can  be  included  in  the  future.  Due  to  the  computational  cost  of  the 
finite  element  simulations,  the  brain  was  only  segmented  into  83 
different  regions.  In  the  future,  increased  segmentation  of  regions 
of  interest  and  improved  biofidelity  of  the  finite  element  model 
would  increase  the  resolution  of  the  analysis.  While  this  study  does 
not  address  the  resulting  functional  outcome  from  structural 
degradation,  coupled  structure-function  relationships  as  a  result  of 
neurotrauma  would  be  interesting  to  explore.  For  example,  a 
coupled  analysis  may  enable  functional  stimulus  that  may  prohibit 
or  enhance  further  cell  death.  For  the  prediction  of  tissue  damage, 
additional  physics,  such  as  electrochemical  reactions,  may  be 
useful  by  incorporating  diffusional  properties.  In  addition, 
increased  resolution  of  the  biomechanical  response  may  also  be 
improved  by  further  developing  white  matter  material  response 
descriptions  that  use  multiple  fiber  tract  orientations  within  a 
single  element,  thereby  enabling  the  capability  to  use  diffusion 
spectrum  imaging.  There  is  also  an  opportunity  to  use  this 
framework  to  explore  additional  injury  mechanisms  or  thresholds 
from  empirical  or  experimental  data,  such  as  intracranial  pressure. 
Note  that  the  current  methodology  degrades  network  edges, 
instead  of  nodes.  In  the  future,  it  may  be  useful  to  investigate 
methods  for  degrading  nodes  in  addition  to  edges  because  the  all 
of  the  nodes  represent  cortical  gray  matter,  and  the  gray  matter 
does  experience  significant  strains.  This  would  require  a  choice  of 
strain  measurement  other  than  axonal  strain  since  the  gray  matter 
is  treated  as  isotropic.  With  this  type  of  criteria,  it  may  be  possible 
to  degrade  a  node  based  on  a  ratio  of  damaged  voxels  within  a 
region  of  interest  compared  to  the  voxel  volume  of  the  region  as  a 
whole. 

Further  work  should  also  work  to  validate  this  approach  in 
humans.  There  are  at  least  two  distinct  areas  associated  with 
connectome  damage  that  should  be  explored  in  order  to  validate 
the  approach  described  herein.  The  first  deals  with  how  well  the 
location  of  damage  within  the  network  description  is  captured 
using  physics-based  predictions.  The  second  validation  strategy 
should  address  the  nature  in  which  edges  and  nodes  in  the  network 
are  degraded.  There  are  various  approaches  that  may  be  useful  for 
addressing  both  areas.  For  example,  in  order  to  validate  how  well 
the  location  of  damage  is  captured,  further  understanding  about 
how  cellular  level  changes  effect  fractional  anisotropy,  that  results 
in  altered  fiber  tractography  would  be  useful  to  develop.  Some  of 
this  information  may  be  obtained  from  various  ongoing  studies 
that  are  examining  the  ability  of  DTI  to  diagnose  mTBI,  which 
could  also  be  extended  to  create  degraded  structural  connectomes. 
One  way  to  do  this  may  be  to  use  DTI  studies  pre-  and  post-injury 
from  typical  loading  conditions  that  cause  rotation-induced  diffuse 
axonal  injury.  By  providing  similar  loading  profiles  within  the 
simulation  and  comparing  computed  DTI  tractography  damage 
with  the  clinical  data  set,  maybe  results  can  be  compared.  Perhaps 
this  may  be  accomplished  using  sports-related  impact  injury,  such 
as  American  football,  where  many  helmets  have  sensors  built-in  to 
record  impact  loads. 

In  conclusion,  this  work  has  explored  "connectome  neuro- 
trauma mechanics"  by  using  physics-based  finite  element  simula- 
tions to  help  elucidate  injury  mechanisms  associated  with 
neurotrauma  by  using  various  cellular  injury  thresholds  to  define 
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tissue  damage,  and  established  a  coupled  computational  frame- 
work to  inform  structural  connectome  analysis. 

Supporting  Information 

Figure  SI    The  magnitude  of  peak  relative  displacement  between 

skull  and  brain  for  validation  of  finite  element  model. 

(EPS) 

Figure  S2    Percent  reduction  in  local  efficiency  at  96  hrs  for  all 

brain  regions. 

(TIF) 

Figure  S3    Percent  reduction  in  betweenness  at  96  hrs  for  all 

brain  regions. 

(TIF) 
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There  are  a  substantial  number  of  empirical  relations  that  began  with  the  identification 
of  a  pattern  in  data;  were  shown  to  have  a  terse  power-law  description;  were  interpreted 
using  existing  theory;  reached  the  level  of  "law"  and  given  a  name;  only  to  be  subse- 
quently fade  away  when  it  proved  impossible  to  connect  the  "law"  with  a  larger  body  of 
theory  and/or  data.  Various  forms  of  allometry  relations  (ARs)  have  followed  this  path. 
The  ARs  in  biology  are  nearly  two  hundred  years  old  and  those  in  ecology,  geophysics, 
physiology  and  other  areas  of  investigation  are  not  that  much  younger.  In  general  if  X 
is  a  measure  of  the  size  of  a  complex  host  network  and  y  is  a  property  of  a  complex 
subnetwork  embedded  within  the  host  network  a  theoretical  AR  exists  between  the  two 
when  Y  =  aX^ .  We  emphasize  that  the  reductionistic  models  of  AR  interpret  X  and 
Y  as  dynamic  variables,  albeit  the  ARs  themselves  are  explicitly  time  independent  even 
though  in  some  cases  the  parameter  values  change  over  time.  On  the  other  hand,  the 
phenomenological  models  of  AR  are  based  on  the  statistical  analysis  of  data  and  inter- 
pret X  and  Y  as  averages  to  yield  the  empirical  AR:  (Y)  =  a{X)^ .  Modern  explanations 
of  AR  begin  with  the  application  of  fractal  geometry  and  fractal  statistics  to  scaling  phe- 
nomena. The  detailed  application  of  fractal  geometry  to  the  explanation  of  theoretical 
ARs  in  living  networks  is  slightly  more  than  a  decade  old  and  although  well  received  it 
has  not  been  universally  accepted.  An  alternate  perspective  is  given  by  the  empirical  AR 
that  is  derived  using  linear  regression  analysis  of  fiuctuating  data  sets.  We  emphasize 
that  the  theoretical  and  empirical  ARs  are  not  the  same  and  review  theories  "explain- 
ing" AR  from  both  the  reductionist  and  statistical  fractal  perspectives.  The  probability 
calculus  is  used  to  systematically  incorporate  both  views  into  a  single  modeling  strategy. 
We  conclude  that  the  empirical  AR  is  entailed  by  the  scaling  behavior  of  the  probability 
density,  which  is  derived  using  the  probability  calculus. 

Keywords:  Allometry;  fractals;  physiology;  scaling;  statistical  analysis;  ontogenetic 
growth. 
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1.  Introduction 

At  the  turn  of  the  nineteenth  century  the  German  polymath  Gauss^^  and  the  Amer- 
ican mathematician  Adrian^  introduced  into  science  the  law  of  frequency  of  errors, 
the  French  physicist  Laplace^ proved  the  central  limit  theorem  and  the  French 
zoologist  Curvier^^  determined  that  the  brain  mass  of  a  mammal  increases  more 
slowly  than  does  its  total  body  mass  (TBM).  Cuvier's  observations  initiated  the 
field  of  Allometry  that  relates  the  size  of  an  organ  to  that  of  the  host  organism  with 
mass  serving  as  a  measure  of  size.  The  roads  of  statistical  analysis  and  observation 
(experiment)  converge  in  the  twenty-first  century  to  explain  the  origin  of  allometry 
relations  (ARs). 

Allometry,  literally  meaning  by  a  different  measure,  has  acquired  a  mathematical 
description  through  its  relations  along  with  a  number  of  theoretical  interpretations 
to  account  for  its  mathematical  form.  However  no  one  theory  has  been  universally 
accepted  as  successfully  explaining  ARs  in  their  many  guises  so  the  corresponding 
origins  remain  controversial.  Consequently,  in  reviewing  the  properties  of  allometry 
data  along  with  their  various  theoretical  explanations  we  herein  provide  a  glimpse 
as  to  those  origins. 

We  use  the  generic  term  network  in  our  narration  in  order  to  slip  smoothly  from 
ARs  in  physical  networks  with  identical  particles  and  van  de  Waal's  forces,  to  bio- 
logical networks  with  structured  elements  and  chemical  interactions,  to  geophysical 
networks  with  complex  tributaries  and  branching  architectures.  This  nomenclature 
also  enables  us  to  transition  from  arcane  historical  theory  to  a  modern  perspectives 
of  complex  networks.  The  mathematics  of  renormalization  group  (RG)  theory, ^^^'^^^ 
fractional  differential  equations, ^^^'^^^  fractional  stochastic  differential  equations^^^ 
and  transitioning  from  dynamic  variables  to  phase  space  variables  to  express  the 
probability  calculus  in  terms  of  fractional  diffusion  equations^^^'^^^'^^^  are  herein 
found  to  provide  insight  into  different  aspects  of  the  origins  of  ARs. 

Allometry  has  been  defined  as  the  study  of  body  size  and  its  consequences ''^'^^^ 
both  within  a  given  organism  and  between  species  in  a  given  taxon.  Gayon^^  re- 
viewed the  history  of  the  concept  of  allometry,  defined  as  the  study  of  body  size 
and  its  consequences ''^'^''^  within  a  given  organism  and  between  species  in  a  given 
taxon,  and  distinguished  between  four  different  forms:  (1)  ontogenetic  allometry, 
which  refers  to  relative  growth  in  individuals;  (2)  phylogenetic  allometry,  which 
refers  to  constant  differential  growth  ratios  in  lineages;  (3)  intraspecies  allometry, 
which  refers  to  adult  individuals  within  a  species;  (4)  interspecies  allometry,  which 
refers  to  the  same  kind  of  phenomenon  among  related  species.  The  theoretical 
entailment  of  static  from  dynamic  allometry  models  has  not  been  systematically 
studied,  although  there  has  been  some  recent  effort  in  that  direction. ^^'^^^  Herein 
we  review  the  use  of  phenomenological  dynamic  equations  from  physical  biology  to 
relate  the  dynamic  allometry  of  the  first  two  categories  to  their  static  counterparts 
in  the  last  two. 

Galileo^^  recognized  that  in  order  for  an  organism  or  physical  structure  to  retain 
a  constant  function  as  size  increases  requires  its  shape  (architecture)  and/or  the 
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materials  with  which  it  is  constructed  to  change.  This  simple  physical  picture  has 
biological  and  social  analogs  in  which  the  size  of  a  network  has  unavoidable  con- 
sequences. ^^^'^^^  We  focus  on  ARs  for  physiologic  phenomena,  but  we  do  indicate 
where  insight  might  also  be  gained  from  other  disciplines  as  well. 

1.1.  What  is  the  equation? 

Sir  Julian  Huxley^^;  grandson  of  the  Huxley  of  Darwin  evolution  fame,  brother  of 
the  novelist  Aldous  {Brave  New  World)  and  half-brother  of  the  biophysicist  Andrew 
(the  Hodgkin-Huxley  equations)  proposed  that  two  parts  of  the  same  organism 
have  proportional  rates  of  growth.  In  this  way  if  Y"  is  a  living  subnetwork  observable 
with  growth  rate  and  X  is  a  measure  of  the  size  of  a  living  host  network  with 
growth  rate  7  then  the  fractional  increase  in  the  two  is  denoted  according  to  Huxley 
by: 

dX/-iX  =  dY/W .  (1) 

This  equation  can  be  directly  integrated  to  obtain: 

y  =  aX\  (2) 

the  time-independent  AR  where  a  and  h  (=  ^/^)  are  empirically  determined.  The 
theoretical  AR  given  by  Eq.  (2)  considered  by  Huxley  is  the  basis  of  subsequent 
theoretical  discussions  in  such  excellent  books  as  Schmidt-Nielson^^^  and  Calder.^^ 
The  intraspecies  AR  relates  a  property  of  an  organism  within  a  species  to  its 
TBM.  The  interspecies  AR  relates  a  property  across  species  such  as  the  basal 
metabolic  rate  (BMR)  to  TBM.^^'^^^  These  two  allometry  groups  are  distinctly 
different  and  the  models  developed  to  determine  the  theoretical  forms  of  the  al- 
lometry coefficient  a  and  exponent  h  in  the  two  cases  are  quite  varied,  as  shown 
subsequently. 

Equation  (2)  looks  very  much  like  the  scaling  relations  that  have  become  so 
popular  in  the  study  of  complex  networks  over  the  last  decade. ^'^^'^^^'^^^'^^^  His- 
torically the  nonlinear  nature  of  Eq.  (2)  has  precluded  the  direct  fitting  of  the 
equation  to  data.  A  logarithmic  transformation  is  traditionally  made  and  a  linear 
regression  to  the  data  on  the  equation 

lnr  =  lna  +  61nX  (3) 

is  used  to  estimate  the  parameters  a  and  h.  In  Sec.  2  we  review  a  myriad  of  phe- 
nomena from  a  number  of  disciplines  in  which  ARs  have  been  brought  to  light.  In 
Sec.  3  we  discuss  the  fitting  of  ARs  to  data. 

All  complex  dynamical  networks  manifest  fluctuations,  either  due  to  intrinsic 
nonlinear  dynamics  producing  chaos^^^'^^^  or  due  to  coupling  of  the  network  to  an 
infinite  dimensional,  albeit  unknown  environment, or  both.  The  modeling  strate- 
gies adopted  to  explain  ARs  have  traditionally  taken  one  of  two  roads:  the  statistical 
approach  in  which  residual  analysis  is  used  to  understand  statistical  patterns  and 


119 


On  Allometry  Relations 

to  identify  the  causes  of  variation  in  the  AR;^^'^^^  or  the  reductionist  approach  to 
identify  mechanisms  that  explain  specific  values  of  the  allometry  parameters. ^^'^^^ 
We  find  that  neither  approach  separately  provides  a  complete  explanation  of  the 
variety  of  phenomena  described  by  ARs.  The  influence  of  the  environment,  whether 
inducing  fluctuations  in  a  reductionist  model,  or  producing  a  systematic  change  in  a 
statistical  model,  has  been  taken  into  account  in  multiple  studies7^'^^'^^^  In  Sec.  5 
we  take  the  road  of  the  probability  calculus  that  systematically  incorporates  both 
reductionistic  and  statistical  mechanism  into  the  phenomenological  explanation  of 
ARs.  This  calculus  enables  modelers  to  associate  characteristics  of  the  measured 
probability  density  function  {pdf)  with  specific  deterministic  mechanisms  and  with 
structural  properties  of  the  coupling  between  variables  and  fluctuations. ^^^'^''^ 


1.2.  Is  there  universality? 

In  physics  the  notion  of  universality  occurs  in  the  context  of  critical  phenomena, 
where  in  the  vicinity  of  a  phase  transition  a  network  ceases  to  have  a  characteristic 
scale  and  can  be  characterized  by  a  set  of  critical  exponents.  The  principle  of 
universality  claims  that  networks  that  undergo  a  phase  transition  can  be  grouped 
into  one  of  a  small  number  of  universality  classes.  Gisiger^^  discusses  universality  in 
the  context  of  invariance  in  biology;  specifically  addressing  the  role  of  1//  noise.  In 
the  context  of  AR  universality  might  imply  that  the  allometry  exponent  plays  the 
role  of  a  critical  exponent.  The  empirical  evidence  suggests  that  this  interpretation 
of  the  allometry  exponent  is  not  appropriate.  A  somewhat  less  stringent  definition 
of  universality  may  be  defensible;  one  in  which  the  details  of  the  interactions  are 
washed  out  but  the  form  of  the  AR  persists  with  the  allometry  exponent  in  a 
restricted  interval. 

In  biology  the  ARs  associate  functional  variables  with  measures  of  body  size, 
such  as  the  TBM  X  =  M  raised  to  a  noninteger  power  .  The  average  BMR  is  one 
such  functional  variable  Y  =  B  that  can  be  expressed  in  terms  of  TBM.  The  most 
prevalent  theories  of  metabolic  allometry  argue  for  either  h  =  2/3,  based  on  body 
cooling,  or  h  =  3/4,  based  on  energy  efficiency.  Selected  data  sets  have  been  used 
by  various  investigators  to  support  either  of  these  two  values.  However,  there  is  also 
strong  evidence  that  there  is  no  universal  value  of  h  that  is  satisfied  by  all  metabolic 
data.  In  fact,  Bokma^^  presents  a  large  amount  of  fish  data  to  demonstrate  that  no 
single  universal  value  of  the  allometry  exponent  h  exists.  This  argument  is  supported 
by  Glazier who  observes  that  the  isometric  (6  =  1)  metabolic  scaling  of  pelagic 
animals  is  an  evolutionarily  malleable  trait  that  responds  to  environmental  changes. 

On  the  other  hand.  West  and  Brown^^^  argue  that  living  networks  do  have  uni- 
versal scaling  laws.  Their  arguments  rest  on  patterns  observed  in  biological  and 
botanical  data  and  the  quantitative  theory  of  the  fractal  structure,  organization 
and  dynamics  of  the  branching  networks  in  living  systems.  The  theory  developed 
by  West  et  al^^^^  which  we  review  in  Sec.  4,  has  as  one  of  its  tenets  the  existence  of 
hierarchical  fractal-like  branching  networks  for  the  delivery  of  nutrients  resulting  in 
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b  =  3/4.  They  attribute  this  origin  of  metabohc  AR  to  evolution's  solution  to  the 
grand  challenge  of  how  highly  complex,  self-sustaining,  reproducing,  living  networks 
service  enormous  numbers  of  localized  microscopic  units  in  an  efficient  and  "demo- 
cratic" way.  Their  conclusion,  like  that  of  the  earlier  analysis  others, ^^^'^^^  was 
that  fractal  networks  have  an  evolutionary  advantage  over  those  that  scale  classi- 
cally, independently  of  what  the  networks  distribute  from  macroscopic  reservoirs 
to  microscopic  sites. 

Scaling  is  a  ubiquitous  property  of  large  complex  networks  indicating  that  the 
observables  simultaneously  fluctuate  over  many  time  and/or  space  scales.  In  the 
physical  sciences  such  phenomena  have  historically  been  categorized  as  1//  noise^^^ 
or  1/f  variability. Mandelbrot^^^  was  probably  the  first  to  recognize  the  wide- 
ranging  significance  of  this  1/f  variability  with  his  introduction  of  fractals  into  the 
scientist's  lexicon.  The  existence  of  ARs  has  been  closely  tied  to  fractal  geometry 
by  some  invest igators^^^  and  in  Sec.  5  we  show  that  the  origin  of  AR  may  reside  in 
fractal  statistics,  the  scaling  of  pdfs^  and  not  in  fractal  geometry. 

1.3.  Some  comments  on  fractals 

Mandelbrot^^^'^^^  identified  ARs  masquerading  under  a  variety  of  empirical  "laws" 
and  argued  that  they  were  a  consequence  of  complex  phenomena  not  having  char- 
acteristic scales.  Subsequent  interpretations  of  ARs  often  involve  fractals  and  so 
we  recall  some  fundamental  properties  of  fractals  that  are  subsequently  useful.  To 
begin  let  us  give  a  qualitative  definition  of  a  fractal: "A  fractal  is  a  shape  made 
of  parts  similar  to  the  whole  in  some  way." 

The  fractal  concept  arises  in  three  distinct,  but  related  guises;  geometry,  statis- 
tics and  dynamics.  Geometric  fractals  deal  with  the  self-similarity  of  complex  geo- 
metric forms.  A  fractal  object  examined  with  ever  increasing  magnification  reveals 
ever  greater  levels  of  detail;  detail  that  is  self-similar  in  character.  The  basic  math- 
ematical properties  of  geometric  fractals  and  their  myriad  of  applications  can  be 
found  in  a  number  of  excellent  books^^'^^'^^'^^^'^^^'^^^'^^^  and  is  not  repeated  here. 
We  merely  record  and  interpret  those  properties  that  may  be  needed  for  the  anal- 
yses of  ARs.  The  number  of  self-similar  objects  N  required  to  cover  an  object  of 
dimension  D  is  given  by  =  r~^,  where  r  is  the  size  of  the  "ruler".  In  this  way 
the  fractal  dimension  of  the  object  being  covered  can  be  mathematically  defined  as: 


in  the  limit  of  vanishing  r.  As  the  ruler  size  goes  to  zero  the  number  of  rulers 
necessary  to  cover  the  object  diverges  to  infinity  in  such  a  way  that  D  remains 
finite  for  self-similar  objects  and  this  dimension  is  not  necessarily  integer  valued. 

An  observable  Z{t)  is  scaling  if  for  a  positive  constant  c  it  satisfies  the  homo- 
geneity relation 


D  =  -  \nN/\nr 


(4) 


Z{ct)  =  c^Z{t) . 


(5) 
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Modifying  the  units  of  the  independent  variable  therefore  only  changes  the  overall 
observable  by  a  multiplicative  factor;  this  is  self- affinity.  Barenblatt^^  remarked 
that  such  scaling  laws  are  not  merely  special  cases  of  more  general  relations;  they 
never  appear  by  accident  and  they  always  reveal  self-similarity.  Note  that  scaling 
alone  is  not  sufficient  to  prove  that  a  function  is  fractal,  but  if  a  function  is  fractal 
it  does  scale. 

In  the  sequel  we  relax  the  distinction  between  self-affine  and  self-similar,  since 
self-similarity  has  been  extended  to  encompass  both  meanings  in  the  physics  liter- 
ature. Meakin^^^  asserts  that  in  homogeneous  scaling  relations  it  is  the  coefficient 
that  embodies  the  "real  physics"  behind  power-law  relations.  He  further  observed 
that  the  allometry  exponents  are  universal  in  many  homogeneous  scaling  phenom- 
ena and  the  allometry  coefficients  provide  the  only  means  to  control  physical  prop- 
erties and  behavior. 

Scale  invariance  or  scaling  requires  that  a  function  ^(Xi, . . . ,  Xn)  be  such  that 
scaling  each  of  the  N  variables  by  an  appropriate  choice  of  exponents  (ai, . . .  ^a^) 
always  recovers  the  same  function  ^{Xi^ . . . ,  Xat)  up  to  an  overall  constant: 

. . . ,  X^)  =  7^^(7-^X1, . . . ,  7""X^) .  (6) 

We  observe  that  Eq.  (2)  is  possibly  the  simplest  of  such  scaling  relations  between 
two  variables  such  that  they  satisfy  the  RG  relation 

X{jY)  =  ^'XiY) . 

The  lowest-order  solution  to  this  equation  is,  of  course,  given  by  Eq.  (2)  and  we 
provide  the  general  solution  subsequently.  Changes  in  the  host  network  X  (size) 
control  (regulate)  changes  in  the  subnetwork  Y  (property)  in  living  networks  and 
in  some  physical  networks  through  the  homogeneous  scaling  relation. 

Inhomogeneity  in  space  and  intermittency  in  time  are  the  hallmarks  of  fractal 
statistics  and  it  is  the  statistical  rather  than  the  geometrical  sameness  that  is  ev- 
ident at  increasing  levels  of  magnification.  In  geometrical  fractals  the  observable 
scales  from  one  level  to  the  next.  In  statistical  fractals  where  the  phase  space  vari- 
ables (z^t)  replaces  the  dynamic  variable  Z{t)  it  is  the  pdf  P{z^t)  that  satisfies  a 
scaling  relation: 

P{az,pt)  =  p-^P{z,t)-    /i  =  lna/ln/3,  (7) 

where  ji  =  2H  and  the  homogeneity  relation  is  interpreted  in  the  sense  of  the  pdf  in 
Eq.  (7).  Time  series  with  such  statistical  properties  are  found  in  multiple  disciplines 
including  finance, economics, neuroscience,^'^^^  geophysics, physiology^^^ 
and  general  complex  networks. A  complete  discussion  of  statisical  data  with  such 
scaling  behavior  is  given  by  Beran^^  in  terms  of  the  long-term  memory  captured 
by  the  scaling  exponent.  One  example  of  a  scaling  pdf  is  given  by: 


(8) 
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and  in  a  standard  diffusion  process  Z{t)  is  the  displacement  of  the  diffusing  particle 
from  its  initial  position  at  time  ji  =  1/2  and  the  functional  form  of  Fz{-)  is  a 
Gauss  distribution.  However,  for  general  complex  phenomena  there  is  a  broad  class 
of  distributions  for  which  the  functional  form  of  Fz[-)  is  not  Gaussian  and  the 
scaling  index  /i  7^  1/2,  see  Sec.  5  for  additional  discussion. 

Dynamic  fractals  do  not  directly  enter  our  discussion  of  ARs.  However  for  com- 
pleteness we  mention  that  in  a  dynamic  fractal  the  geometry  of  the  manifold  on 
which  the  dynamics  of  a  network  unfolds  is  fractal,  so  that  the  associated  chaotic 
time  series  is  also  fractal. 


2.  Empirical  Allometry 

In  this  section  we  review  areas  of  investigation  where  size  has  been  observed  to 
control  the  properties  of  a  phenomenon.  In  so  doing  we  catalog  a  number  of  phe- 
nomenological  relations  that  are  not  often  discussed  from  the  same  perspective. 
There  is  a  nontrivial  number  of  empirical  relations  that  began  as  the  identification 
of  a  pattern  in  data;  were  shown  to  have  a  terse  power-law  description;  were  in- 
terpreted using  existing  theory;  reached  the  level  of  "law"  and  given  a  name,  not 
always  after  the  discoverer;  only  to  subsequently  fade  away  when  it  proved  impos- 
sible to  connect  the  "law"  with  a  larger  body  of  theory  and/or  data.  An  example 
drawn  from  the  Notebooks  of  Leonardo  da  Vinci^^^  relates  the  diameter  of  a  parent 
limb  dQ  to  two  daughter  limbs  di  and  ^2  • 

=  + d^  (9) 

The  da  Vinci  scaling  relation  supplies  the  phenomenological  mechanism  necessary 
for  AR  to  emerge  in  a  number  of  disciplines,  as  we  subsequently  discuss. 

Nearly  five  hundred  years  after  de  Vinci  recorded  his  observations  Murray^^^ 
used  energy  minimization  to  derive  the  same  equation  with  the  theoretical  value 
a  =  3,  which  is  known  in  the  literature  as  Murray's  Law  or  the  Murray-Hess  Law. 
In  the  simplest  case  di  =  ^2  the  da  Vinci  scaling  relation  reduces  to  scaling  between 
sequential  generations  of  a  bifurcating  branching  network  having  daughter  branches 
of  equal  radii: 

4+1  =  2-i/"4  (10) 

resulting  in  an  exponential  reduction  in  branch  diameter  from  generation  to 
generation. 


2.1.  Living  networks 

Living  networks  have  static  intraspecies  ARs  that  link  two  distinct  but  interacting 
parts  of  the  same  organism  in  terms  of  mass,  with  mass  serving  as  a  measure  of 
size.  Smith^^^  maintained  that  concentrating  on  a  power  function  as  the  method 
for  evaluating  the  biological  consequences  of  size  has  masked  the  complexity  ofthe 
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allometry  problem.  We  agree  with  this  observation,  but  perhaps  in  ways  that  Smith 
would  not  have  anticipated,  as  will  become  evident. 

2.1.1.  Biology 

Cuvier^^  was  the  first  to  recognize  that  brain  mass  increases  more  slowly  than 
TBM  as  we  proceed  from  small  to  large  species  within  a  taxon.  This  empirical 
observation  was  subsequently  made  between  many  other  biological  observables  and 
was  first  expressed  mathematically  as  an  allometric  relation  by  Snell^^^: 

brainweight  =  a  (body  weight)^  ,  (11) 

where  on  log-log  graph  paper  a  is  the  intercept  with  the  vertical  axis  and  b  is  the 
slope  of  the  line  segment.  Mammalian  neocortical  quantities  Y  have  subsequently 
been  empirically  determined  to  change  as  a  function  of  neocortical  grey  matter 
volume  X  as  an  AR.  The  neocortical  allometry  exponent  was  first  measured  by 
Tower^^^  for  neuron  density  to  be  approximately  —1/3.  The  total  surface  area  of 
the  mammalian  brain  was  found  to  have  an  allometry  exponent  of  approximately 
g^g  94,100,167  c^angizi^^  points  out  that  the  neocortex  undergoes  a  complex  trans- 
formation covering  the  five  orders  of  magnitude  from  mouse  to  whale  depicted  in 
Fig.  1  but  the  ARs  persist;  those  mentioned  here  along  with  many  others. 


Body  miss  (kg) 

Fig.  1.  Mouse  to  elephant  curve.  BMR  of  mammals  and  birds  are  plotted  versus  TBM  on  log-log 
graph  paper.  The  solid  line  sement  is  the  best  linear  regression  of  Eq.  (2)  to  the  data  with  a  slope 
very  close  to  3/4.  [reproduced  from  Schmidt-Nielsen-*^^^  with  permission]. 
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Another  quantity  of  interest  is  the  time;  not  the  chronological  time  measured  by 
a  clock  but  the  intrinsic  time  of  a  biological  process  first  called  biological  time  by 
Hill.^^  Hill  reasoned  that  since  so  many  properties  of  an  organism  change  with  size 
that  time  itself  may  scale  with  TBM.  Lindstedt  and  Calder^^^  develop  this  concept 
further  and  determine  experimentally  that  biological  time,  such  as  species  longevity, 
satisfies  an  AR  with  Y  being  the  biological  time.  Lindstedt  et  al}'^^  clarify  that 
biological  time  r  is  an  internal  mass-dependent  time  scale 

r  =  aM^  (12) 

to  which  the  duration  of  biological  events  are  entrained.  They  present  a  partial  list 
of  such  events  that  includes  breath  time,  time  between  heart  beats,  blood  circulation 
time  and  time  to  reach  sexual  maturity.  In  all  these  examples  and  many  others  the 
allometry  exponent  clusters  around  the  theoretical  value  h  =  1/4.  Note  that  the 
total  energy  of  an  organism  seen  as  a  bioreactor  is  proportional  to  volume  (M)  and 
the  biological  time  is  proportional  to  M^/^,  so  the  metabolic  rate  (energy /time) 
would  scale  as  M^/^. 


2.1.2.  Botany 

Niklas^^^  shows  in  Fig.  2  an  impressive  statistical  trend  spanning  twenty  orders  of 
magnitude  in  the  mass  of  aquatic  and  terrestrial  nonvascular  and  vascular  plant 
species.  The  annual  growth  in  plant  body  biomass  Gt  (net  annual  gain  in  dry 
mass  per  individual)  and  Mt  (total  dry  mass  per  individual)  are  related  by  the 
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Fig.  2.  Log-log  bivariate  plot  of  total  annual  growth  rate  in  dry  body  mass  per  individual  Gt 
versus  TBM.  Line  segment  denotes  reduced  major  axis  regression  curves  for  the  entire  data  set. 
[adapted  from  Niklas"*^^^  with  permission]. 
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empirical  AR: 

Gt  oc  M^/"^  .  (13) 

Figure  2  shows  linear  regression  on  logarithmically  transformed  data 

log  Gt  =  log  a  +  0.75  log  Mr  (14) 

and  the  parameter  a  is  the  intercept  with  the  vertical  axis.  In  the  data  analyses 
recorded  herein  the  terms  weight,  mass  and  volume  are  used  almost  interchange- 
ably as  measures  of  size.  The  allometry  exponent  is  3/4  for  the  data  in  Fig.  2  but 
empirically  differs  from  this  value  when  the  data  sets  are  graphed  individually.  The 
allometry  coefficients  of  the  separate  data  sets  may  vary  as  a  function  of  habi- 
tat as  well.  The  agreement  between  the  biomass  data  and  the  AR  with  exponent 
3/4  is  very  suggestive  but  it  must  be  viewed  critically  because  of  methodological 
limitations. 

Reich  et  al}^^  analyzed  data  for  approximately  500  observations  of  43  perennial 
plant  species  of  coupled  measurements  of  whole-plant  dry  mass  and  Gt  from  four 
separate  studies.  Collectively,  the  observations  span  five  of  the  approximately  12 
orders  of  magnitude  of  size  in  vascular  plants. The  result  of  each  experiment 
yielded  an  isometric  scaling  of  6  ^  1  and  not  h  =  3/4  as  did  the  scaling  of  Gt  to 
TBM  for  whole  plants.  Consequently,  even  when  data  look  as  appealing  as  they  do 
in  Fig.  2  things  are  not  always  what  they  seem. 


2.1.3.  Clearance  curves 

Another  allometry  phenomenon  is  the  dependence  of  drug-dosing  range  on  TBM 
and  is  referred  to  as  clear ance.^^  Zenobiotic  clearance  is  the  rate  at  which  any  foreign 
compound  not  produced  by  an  organism's  metabolism  is  passed  from  the  organism. 
The  application  of  allometry  ideas  to  pharmacokinetics  and  to  determining  human 
parameters  from  those  in  animals  is  fairly  recent. ^^'^^^'^^^  The  early  studies  did 
not  address  questions  of  variability  in  the  allometry  parameters  and  were  primarily 
concerned  with  whether  the  allometry  exponent  more  closely  tracked  the  value  2/3 
or  3/4  by  doing  linear  regression  on  log-transformed  data.^^^ 

Hu  and  Hayton^''  addressed  the  possible  impact  of  statistical  variability  in  the 
AR  parameters  on  the  predicted  pharmacokinetic  parameter  values.  They  found 
considerable  uncertainty  in  the  value  of  the  allometry  exponent,  which  they  fit  to  a 
Gaussian  distribution  with  mean  value  0.74.  Even  though  they  could  not  determine 
whether  the  variability  in  the  allometry  exponent  was  due  to  experimental  error 
or  to  biological  mechanisms  they  did  find  that  there  was  no  systematic  deviation 
from  the  AR.  However  it  appears  that  whether  h  =  3/4  or  2/3  depends  on  which 
of  the  major  elimination  pathways  is  used,  metabolism  for  the  3/4  value  and  renal 
excretion  for  the  2/3  value. 
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2.1  A.  Physiology 

The  most  studied  of  the  interspecific  ARs  does  not  concern  relative  growth  but  is 
that  associating  the  average  BMR  measured  in  watts  to  average  TBM  measured  in 
kilograms  of  multiple  species  such  that: 

B  =  aM\  (15) 

The  metabolic  rate  refers  to  the  total  utilization  of  chemical  energy  for  the  genera- 
tion of  heat  by  the  body  of  an  animal  and  is  often  measured  by  the  oxygen  intake 
during  respiration. 

The  earliest  physiologic  model  for  the  value  of  the  allometry  exponent  in  the 
intraspecies  AR  was  given  by  Sarrus  and  Rameaux.^^^  Schmidt-Nielsen^^^  records 
that  this  team  of  a  mathematician  and  a  physician  reasoned  that  the  heat  gener- 
ated by  a  warm  blooded  animal  is  proportional  to  the  volume  and  the  heat  loss 
is  proportional  to  the  animal's  free  surface.  Experiments  on  dogs  by  Rubner^^^ 
supported  their  argument  and  lead  to  the  wide  acceptance  of  the  "surface  law"  in 
which  6  =  2/3.  In  Fig.  1  the  "mouse-to-elephant"  curve  depicts  the  BMR  for  mam- 
mals and  birds  plotted  versus  TBM  on  log-log  graph  paper  spanning  six  orders  of 
magnitude.  The  solid  line  segment  is  the  fit  of  the  AR  to  the  data  and  yields  the 
empirical  value  b  ~  3/4. 

As  Niklas^^^  noted  the  expectation  was  that  the  metabolic  rate  would  be  pro- 
portional to  the  2/3  power  of  the  TBM  as  prescribed  by  the  surface  law.  Surpris- 
ingly, this  turned  out  not  to  be  the  case.  The  research  of  Kleiber^^^  and  Brody^^ 
revealed  that  the  slope  was  closer  to  3/4  than  to  2/3.  Subsequent  observational 
studies  have  reinforced  the  allometric  pattern  observed  in  the  data  predicted  by 
Eq.  (15)  including  some  relating  the  3/4-rule  to  plants,  see,  for  example,  Hem- 
mingsen.^^  Consequently  the  phenomenological  value  of  the  allometry  exponent  b 
remains  controversial.^^'^^'^^ 

Controversy  also  persists  regarding  the  theoretical  explanation  as  to  why  the 
allometry  exponent  b  should  have  a  specific  value.  The  simple  geometrical  argument 
of  Sarrus  and  Rameaux  suggests  b  =  2/3  as  reviewed  in  a  number  of  excellent 
sources. ^^'^'''^^  On  the  other  hand,  the  quarter-power  AR  is  explained  by  West, 
Brown  and  Enquist  (WBE)  using  geometric  scaling  arguments  from  fractal  physics 
to  establish  the  value  6  =  3/4  and  other  quarter-power  scaling  laws  in  physiology, 
see  Sec.  4. 

Heusner^^  adopted  geometric  scaling  arguments  to  obtain  b  =  2/3  in  the  AR 
between  BMR  and  TBM.  He  argued  that  the  various  other  values  experimentally 
observed  for  the  power-law  index  by  investigators  are  a  consequence  of  differing 
values  of  the  allometric  coefficient  a.  He  reasoned  that  two  or  more  data  sets  with 
6  =  2/3  but  with  different  values  of  a  graph  as  parallel  line  segments  on  log-log 
graph  paper,  but  when  the  two  or  more  data  sets  are  grouped  together  and  analyzed 
as  a  single  data  set  the  aggregate  is  fit  by  a  single  line  segment  with  net  slope 
b  >  2/3.  The  same  argument  can  be  found  in  a  number  of  other  references. ^^^'^^^ 
But  unlike  those  earlier  references  Heusner^^  concluded  that  it  is  the  allometry 
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coefficient  tiiat  remains  the  central  mystery  of  allometry  and  not  the  allometry 
exponent.  We  investigate  the  implications  of  Heusner's  conjecture  in  Sec.  3  where 
we  numerically  explore  the  implications  of  treating  the  allometry  coefficient  as  a 
random  variable. 

It  is  useful  to  list  the  various  forms  of  physiological  allometry  given  by  Lindstedt 
and  Schaeffer^^^:  pulmonary  and  cardiac  allometry  with  b  =  —1/4;  renal  allometry 
with  b  =  —0.85;  liver  allometry  with  b  =  —0.85;  pulmonary  blood  volume  with 
b  =  1.0;  cardiac  output  with  6  =  3/4  and  pulmonary  transit  times  with  6  =  1/4. 


2.1.5.  Information  transfer 

There  are  literally  dozens  of  physiologic  ARs  for  physiologic  time  r,  relative  to  clock 
time  t,  that  increases  with  increasing  body  size  r  =  aM^^^^  and  describes  chemical 
processes  such  as  the  turnover  time  for  glucose  with  b  =  1/4^  to  the  life  span  of 
various  animals  in  captivity  with  b  =  0.20.^^^  Schmidt-Nielsen^^^  explains  how  a 
variety  of  physiologic  time  scales  such  as  the  length  of  a  heart  beat  and  respiration 
all  scale  with  body  size  and  from  that  deduce  a  number  of  interesting  relations. 
It  is  only  recently  however  that  Hempleman  et  al.^^  hypothesized  a  mechanism  to 
explain  how  information  about  the  size  of  an  organism  is  communicated  to  the 
organs  within  the  organism.  Their  hypothesis  involved  matching  the  neural  spike 
code  to  body  size  to  convey  this  information. 

Hempleman  et  al.^^  suggest  that  mass-dependent  scaling  of  neural  coding  may 
be  necessary  for  preserving  information  transmission  with  decreasing  body  size. 
They  point  out  that  action  potential  spike  trains  are  the  mechanisms  for  long 
distance  information  transmission  in  the  nervous  system.  They  go  on  to  say  that 
neural  information  may  be  "rate  coded"  with  average  spike  rate  over  a  time  period 
encoding  stimulus  intensity  or  "time  coded"  with  the  occurrence  of  a  single  spike 
encoding  the  occurrence  of  a  rapid  stimulus  transition.  The  hypothesis  is  that  some 
phasic  physiological  traits  are  sufficiently  slow  in  large  animals  to  be  neural  rate 
coded,  but  are  rapid  enough  in  small  animals  to  require  neural  time  coding.  These 
trait  include  such  activities  as  breathing  rates  that  scale  with  b  =  —1/4. 

They  tested  for  this  allometry  scaling  of  neural  coding  by  measuring  action  po- 
tential spike  trains  from  sensory  neurons  that  detect  lung  CO2  oscillations  linked  to 
breathing  rate  in  birds  ranging  in  body  mass  from  0.045  to  5.23  kg.  While  it  is  well 
known  that  spike  rate  codes  occur  in  the  sensing  of  low  frequency  signals  and  spike 
timing  codes  occur  in  the  sensing  of  high  frequency  signals,  their  experiment  was 
the  first  designed  to  test  the  transition  between  these  two  coding  schemes  in  a  single 
sensory  network  due  to  variation  in  body  mass.  The  results  of  their  experiments 
on  breathing  rate  was  an  allometry  exponent  in  the  interval  —0.26  <  b  <  —0.23 
and  although  taken  on  a  small  number  of  birds  their  results  do  suggest  a  preserva- 
tion of  information  transmission  rates  for  high  frequency  signals  in  intrapulmonary 
chemoreceptors  and  perhaps  other  sensory  neurons  as  well.  The  implications  of 
these  experiments  strongly  suggest  the  need  to  continue  such  investigations. 
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On  the  more  theoretical  side  Moses  et  al}"^^  apply  the  scaling  ideas  of  metabolic 
allometry  developed  by  West  et  al?"^^  to  information  networks  consisting  of  micro- 
processors to  form  a  network.  Moses  et  al}^^  use  a  fractal  argument  to  construct  a 
two-dimensional  hierarchal  self-similar  branching  network,  an  H-tree  that  Mandel- 
brot^^^  originally  used  in  his  discussion  of  the  space  filling  behavior  of  the  human 
lung.  They  show  that  this  branching  network  of  microprocessors  have  a  striking 
similarity  to  such  networks  in  organisms  even  though  the  latter  has  evolved  by 
natural  selection  and  the  former  are  designed  by  engineers. 

Along  this  same  line  E.  F.  Rent,  while  an  IBM  employee  in  the  1960s,  wrote  a 
number  of  internal  memos  (unpublished)  relating  the  number  of  pins  at  the  bound- 
aries of  an  integrated  circuit  (X)  to  the  number  of  internal  components  (F),  such 
as  logic  gates,  to  obtain  an  AR  with  6  <  1.0.  This  rule  has  historically  been  used  by 
engineers  to  estimate  power  dissipation  in  interconnections  and  for  the  placement 
of  components  in  very  large  scale  integrated  (VLSI)  circuit  design.  More  recently 
Rent's  Rule  has  been  used  to  model  information  processing  networks  in  the  human 
brain^^  where  the  mass  of  grey  and  white  matter  are  shown  to  satisfy  an  AR  as  first 
noted  by  Schlenska.-*^^^  Beiu  and  Ibrahim-*^^  suggested  that  the  allometry  exponent 
for  grey  and  white  matter  between  species  is  identical  to  the  Rent  exponent  within 
a  species  and  this  was  supported  using  MRI  data  by  Bassett  et  al}^ 

2.2.  Physical  networks 

Some  of  the  oldest  ARs  involve  physical  networks,  or  more  specifically  geophys- 
ical networks.  The  skeptic  need  only  return  to  da  Vinci's  scaling  relation.  In  his 
notebooks  da  Vinci  explains  the  meaning  of  this  equation  not  only  in  the  context 
of  relating  tree  trunks  to  subsequent  branches,  but  to  the  branchings  of  rivers  as 
well.  Long  before  the  conservation  of  energy  and  the  continuity  of  fiuid  fiow  were 
known  to  scientists,  the  enigmatic  Italian  painter,  sculptor,  military  engineer  and 
anatomist  understood  the  basics  of  hydrologic  networks. 

2.2.1.  Geology  and  geomorphology 

Horton's  law  of  river  numbers  is  another  empirical  regularity  observed  in  the  topol- 
ogy of  river  networks. As  observed  by  Scheidegger^^^  the  number  of  river  segments 
in  successive  order  form  a  geometrical  sequence  such  that,  the 

number  of  rivers  with  k  tributaries  ex  R\~^  -  (16) 

The  bifurcation  parameter  is  the  constant  ratio  between  successive  numbers  of 
river  networks,  known  as  Horton's  law  of  stream  numbers  nk/rtk+i  =  and  has 
the  empirical  value  between  4.1  and  4.7  in  natural  river  networks, ^^^'^^^  in  contrast 
to  the  random  model  that  predicts  a  value  of  four.  Note  that  the  system  of  counting 
begins  at  the  smallest  tributary  that  are  the  most  numerous  i?^  >  1  and  these  feed 
into  larger  tributaries  that  are  fewer  in  number. 
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Dodds  and  Rothman^^  point  out  that  universality  arises  when  the  quahtative 
character  of  a  network  is  sufficient  to  quantify  its  essential  features,  such  as  the 
exponents  that  characterize  scaling  laws.  They  go  on  to  say  that  scaling  and  uni- 
versality have  found  application  in  the  geometry  of  river  networks  and  the  statistical 
structure  of  topography  within  geomorphology.  They  maintain  that  the  source  of 
scaling  in  river  networks  and  whether  or  not  such  scaling  belongs  to  a  single  uni- 
versality class  is  not  yet  known.  They  do  provide  a  critical  analysis  of  Hack's  law, 
see  also,  Rodriguez-Iturbe  and  Rinaldo.^^^ 

2.2.2.  Hydrology 

Hack's  law  is  a  hydrologic  AR  having  to  do  with  the  drainage  basins  of  rivers.  Hack^^ 
developed  an  empirical  relation  between  mainstream  length  of  a  river  network  and 
the  drainage  basin  area  at  the  closure  of  the  river.  Hack's  law  is  given  by, 

mainstreamlength  oc  (area)^  ,  (17) 

where  h  is  the  Hack  exponent  with  the  typical  empirical  value  h  ~  0.57.  Hack 
asserted  that  river  networks  are  not  self-similar. 

Mandelbrot^^^  relates  Hack's  exponent  to  the  fractal  dimension  of  the  river  net- 
work and  debates  the  interpretation  of  the  equation.  Feder^^  observed  that  defining 
a  fractal  dimension  for  river  networks  was  obscure  and  required  further  study.  A 
modern  version  of  this  discussion  in  terms  of  hydrologic  allometry  is  given  by  Ri- 
naldo  et  al}^^  who  point  out  that  optimal  channel  networks  yield  h  =  0.57  ±  0.02 
suggesting  that  feasible  optimality^^^  implies  Hack's  law.  Another  viable  model  is 
given  by  Sagar  and  Tein^^^  that  is  geomorphology  realistic  giving  rise  to  general 
ARs  in  terms  of  river  basin  areas,  as  well  as  parallel  and  perpendicular  channel 
lengths. 

Maritan  et  al}^^  consider  an  analogy  with  the  metabolic  AR  using  M  oc  5", 
that  is,  a  =  1/6  so  that  a  =  1  -\-  h  with  the  limiting  values  a  =  3/2  and  h  =  1/2 
in  the  case  of  geometric  self-similarity.  Geometric  self-similarity  is  the  preservation 
of  the  river's  shape  as  the  basin  increases  in  area.  The  observed  values  lie  in  the 
range  1.5  <  a  <  1.6  and  the  scatter  of  individual  curves  (analog  of  intraspecies 
data)  is  remarkably  small.  These  values  suggest  that  the  branching  nature  of  rivers 
is  fractal,  that  is,  a  >  3/2  in  most  cases.  The  ensemble  average  of  Hack's  exponents 
from  different  basins  extend  over  11  orders  of  magnitude  and  is  indistinguishable 
from  h  =  \/2}^^  Maritan  et  al}^^  conclude  that  like  the  interspecies  metabolic  rate, 
the  slope  of  the  intraspecies  /I's  are  washed  out  in  the  ensemble  average,  resulting 
in  the  value  /i  =  1/2. 

2.3.  Natural  History 

Natural  History  embraces  the  study,  description  and  classification  of  the  growth 
and  development  of  natural  phenomena.  The  focus  of  investigation  includes  such 
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important  contemporary  areas  as  ecology  and  paleontology,  parts  of  which  rely 
heavily  on  AR  and  scaling. 

2.3.1.  Ecology 

Ecology  is  the  scientific  study  of  the  distribution,  abundance  and  relations  of  or- 
ganisms and  their  interactions  with  the  environment.  Such  living  networks  include 
both  plant  and  animal  populations  and  communities  along  with  the  network  of  re- 
lations among  organisms  of  different  scales  of  organization.  Of  concern  to  us  here 
are  the  species  traits  determined  by  body  size  and  how  these  in  turn  affect  food 
web  stability.  Woodward  et  al?^^  along  with  many  others  point  out  that  the  largest 
metazoans,  for  example,  whales  (10^  grams)  and  giant  sequoias,  weigh  over  21  or- 
ders of  magnitude  more  than  the  smallest  microbes  (10~^^  grams). ^^^'^^^  They  go 
on  to  stress  the  considerable  variation  in  body  mass  among  members  of  the  same 
food  web. 

We  do  not  address  the  traditional  linking  of  species  through  such  mechanisms  as 
the  prey-predator  relation  within  food  webs^^  but  instead  we  focus  on  body  size. 
The  significance  of  body  size  has  been  systematically  studied  in  ecology. ^^'^^'^^^ 
Indentifying  X  with  species  abundance  and  Y  with  TBM  in  Eq.  (2)  there  is,  in 
fact,  an  AR  between  the  species  at  the  base  of  a  food  web  and  the  largest  predator 
at  the  top.^^  We  note  that  species-area  power  functions  have  a  vital  history  in 
ecology^^^'^^^  even  though  the  domain  of  sizes  over  which  the  power  law  appears 
valid  is  controversial. ^^'^^^ 

Woodward  et  al?^^  emphasize  that  AR  has  been  used  to  explain  the  observed 
relations  between  body  size  and  each  of:  home  range  size,  nutrient  cycling  rates, 
numerical  abundance  and  biomass  production.  They  speculate  that  body  size  may 
capture  a  suite  of  covarying  species  traits  into  a  single  dimension,  without  the 
necessity  of  having  to  observe  the  traits  directly. 

Brown  et  al?^  discuss  the  universality  of  the  documented  ARs  in  plants,  ani- 
mals and  microbes;  to  terrestrial,  marine  and  freshwater  habitats;  and  to  human- 
dominated  as  well  as  "natural"  ecosystems.  They  emphasize  that  the  observed 
self-similarity  is  a  consequence  of  a  few  basic  physical,  biological  and  mathematical 
principles;  one  of  the  most  fundamental  being  the  extreme  variability  of  the  data. 
The  variety  of  distributions  of  allometry  coefficients  and  exponents  are  discussed 
both  phenomenologically  and  theoretically  in  subsequent  sections. 

Farr's  law^^  is  an  example  of  the  change  in  ARs  in  the  transition  from  organ- 
ismic  to  environmental  allometry.  Farr  collected  data  on  the  number  of  patients 
committed  because  of  their  mental  condition  and  their  mortality  from  a  variety  of 
asylums  in  1830s  England. From  these  humble  beginnings  he  was  able  to  sum- 
marize the  "evil  effect  of  crowding"  into  a  relation  between  mortality  rate  R  and 
population  density  p^^: 


R  =  apK 


(18) 
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Here  we  see  that  the  size  measure  used  in  the  metabohc  AR,  the  mass,  is  replaced 
with  a  measure  of  community  structure,  the  population  density.  The  ARs  that  cap- 
ture life  histories  in  ecology  and  sociology  are  often  expressed  in  terms  of  numbers 
of  animals  and  areas  in  addition  to  body  mass.  Calder^^  points  out  that  size  and 
time  seem  to  be  the  principle  characteristics  of  life  history  and  ecology. 

The  factors  necessary  for  the  formation  of  social  groups  with  a  restricted 
geographic  area  are  not  understood,  but  certain  ARs  help  clarify  them.  The  popu- 
lation density  p  of  herbivorous  mammals  has  been  determined  to  be  related  to  their 
mass  M  by^^  =  cM^-^^  where  the  animals  freely  roam  on  a  "home  range"  of 
given  area:  Ahr  =  c'M^-^^.  A  similar  relation  exists  for  carnivores. Consequently, 
as  explained  by  Calder,^^  herbivorous  mammals  above  a  certain  size  range  over  an 
area  greater  than  their  per  capita  share  of  the  local  habitat.  The  degree  of  overlap 
was  empirically  determined  to  be^^  ^hrP  =  c^^M^*^^,  where  the  empirical  exponent 
0.34±0.11  is  statistically  indistinguishable  from  that  obtained  by  combining  the  sep- 
arate exponents  for  the  population  density  and  area,  that  being,  1.02  —  0.75  =  0.27. 
Calder  conjectures  that  the  greater  the  product  AhrP  the  greater  the  intensity  of 
competition  and  the  greater  the  desirability  of  social  networks  that  contribute  to 
mutual  tolerance  within  these  groups.  Makarieva  et  al}'^^  argue  that  animal  home 
range  represents  a  biological  footprint  of  the  undisturbed  state  of  an  econetwork, 
however  the  population  density  adapts  to  disturbances  in  the  econetwork.  Conse- 
quently, the  deviation  of  the  home  range-population  product  from  isometry  reflects 
the  degree  of  econetwork  disturbance. 

The  AR  between  maximum  abundance  and  body  size  for  terrestrial  plants  N  oc 
was  extended  by  Belgrano  et  al}^  to  the  maximum  population  densities  of 
marine  phytoplankton  with  h  =  3/4.  They  draw  the  implication  that  maximum 
plant  abundance  is  constrained  by  rates  of  energy  supply  in  both  terrestrial  and 
marine  networks  as  dictated  by  a  common  AR.  Earlier  investigators  found  h  > 

3/4^37,49 

2.3.2.  Zoology  and  acoustics 

Mice  squeak,  birds  chirp  and  elephants  trumpet  due  to  scaling.  Fitch^^  discusses  the 
relationship  between  an  organism's  body  size  and  acoustic  characterization  of  its 
vocalization  under  the  rubric  of  acoustic  allometry.  Data  indicate  an  AR  between 
palate  length  (the  skeletal  proxy  for  vocal  tract  length)  and  body  mass  for  a  variety 
of  mammalian  species.  He  shows  that  the  interspecies  allometry  exponent  attains 
the  geometric  value  of  three  in  the  regression  of  skull  length  and  body  mass,  whereas 
the  intraspecies  allometry  exponent  varies  a  great  deal.  The  significant  variability 
in  the  intraspecies  allometry  exponent  suggest  taxon-specific  factors  influencing  the 

AR.137,199 

Fitch^^  gives  the  parsimonious  interpretation  that  the  variability  in  the  in- 
traspecies allometry  exponent  could  be  the  result  of  each  species  adopting  allomet- 
ric  scaling  during  growth  as  postulated  by  Huxley,  with  a  different  proportionality 
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factor  for  each  species.  On  the  other  hand,  the  interspecies  ahometry  exponent 
could  result  from  the  common  geometric  constraints  across  species  due  to  the  wide 
range  of  body  sizes.  He  concludes  that  the  AR  between  vocal  tract  dimensions 
and  body  size  could  provide  accurate  information  about  a  vocalizer's  size  in  many 
mammals. 

2.3.3.  Paleontology 

Pilbeam  and  Gould^^^  provide  reasons  as  to  why  body  size  has  played  such  a 
significant  role  in  biological  macroevolution.  The  first  is  the  statistical  generalization 
known  as  Cope's  Law  or  Rule,  which  states  that  population  lineages  increase  in 
body  size  over  evolutionary  time  scales, that  is,  the  body  size  of  a  species  is  an 
indication  of  how  long  it  has  survived  on  geological  time  scales.  A  second  reason  is 
the  one  mentioned  earlier,  Galileo's  observation  that  large  organisms  must  change 
shape  in  order  to  function  in  the  same  way  as  do  their  smaller  prototypes. 

One  quantitative  measure  of  evolution  is  the  development  of  the  brain  in  mam- 
mals at  various  stages  of  evolution.  Jerison^^  showed  that  the  brain-body  relation 
given  by  Eq.  (2)  is  satisfied  by  mammals  with  an  exponent  that  is  statistically 
indistinguishable  from  2/3.  He  suggested  that  a  may  be  an  appropriate  measure 
of  brain  evolution  in  mammals  as  a  class,  following  the  proposal  of  Dubois^^  that 
a  quantitative  measure  of  cephalization  in  contemporary  mammals  be  based  on 
the  ratio;  a  =  brain  weight/(body  weight)^.  These  hypotheses  were  directly  tested 
by  Jerison^^  using  endocranial  volumes  and  body  volumes  for  fossil  mammals 
at  early  and  intermediate  evolutionary  stages.  The  data  did  in  fact  support  the 
hypothesis. 

White  and  Gould^^^  emphasize  in  their  review  of  the  meaning  of  the  allome- 
try  coefficient  a  that  it  had  generated  a  large  and  inconclusive  literature.  Reiss^^^ 
notes  that  if  brain  mass  is  regressed  on  TBM  across  individuals  in  a  species  the 
slopes  are  shallower  than  of  regressions  calculated  across  mean  values  for  different 
species  within  a  single  family  (genus).  This  argument  had  also  been  presented  by 
Gould^^  who  emphasized  the  importance  of  the  allometry  coefficient  in  the  geomet- 
ric similarity  of  allometric  growth.  This  interpretation  of  the  allometry  coefficient 
was  at  odds  with  the  belief  of  the  majority  of  the  scientific  community  at  the  time 
that  the  allometry  coefficient  was  independent  of  body  size.  This  latter  view  is  also 
contradicted  by  the  data  analysis  in  Sec.  3. 

Allometry  has  been  used  by  Alberch  et  al?  as  the  first  step  in  creating  a  uni- 
fying theory  in  developmental  biology  and  evolutionary  ecology  in  their  study  of 
morphological  evolution.  They  demonstrate  how  their  proposed  formalism  relate 
changes  in  size  and  shape  during  ontogeny  and  phylogeny. 

3.  Data  Statistics 

In  this  section  the  phenomenology  of  analyzing  AR  measurement  using  statistics 
is  discussed;  collecting  data,  identifying  patterns  (laws)  in  the  data  and  developing 
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methods  of  statistical  analysis.  Warton  et  al?'^^  point  out  that  fitting  a  line  to  a 
bivariate  data  set  is  not  a  simple  task  and  the  AR  literature  is  filled  with  debate 
over  the  proper  methodology.  Sir  Julian  readily  adopted  the  statistical  approach  of 
linear  regression  to  Eq.  (3)  on  multiple  data  sets  to  determine  the  allometry  coef- 
ficient a  and  exponent  h.  The  sophisticated  statistical  techniques  such  as  principle 
component  analysis  were  not  available  to  Sir  Julian  and  although  they  can  be  found 
in  the  modern  AR  literature  least-square  regression  still  appears  to  be  the  method 
of  choice. '^^'^^^ 

Measures  of  complex  phenomena  always  contain  uncertainty  and  the  data  sets 
invariably  fiuctuate  from  measurement  to  measurement.  Thus,  X{t)  and  Y{t)  are 
stochastic  variables,  and  by  implication  X{Y)  is  a  random  function  of  its  argu- 
ment as  well.  Consequently  the  deterministic  algebraic  equations  relating  network 
size  and  network  function  never  unambiguously  represent  what  is  actually  being 
measured.  Subsequently,  we  examine  the  corresponding  mathematical  modeling, 
shifting  the  focus  from  the  dynamic  stochastic  variable  to  the  associated  dynamic 

pdf. 

We  examine  the  phenomenology  of  the  random  data  from  measurements  of  var- 
ious properties  of  allometry  networks  that  determine  empirical  ARs.  In  particular 
we  focus  on  how  allometry  coefficients  and  exponents  are  interpreted  given  that  the 
data  on  which  they  are  based  fiuctuate  to  such  a  large  extent.  Part  of  the  reason 
for  taking  this  approach  is  that  a  significant  number  of  scientists  adopt  the  view- 
point that  fiuctuations  refiect  lack  of  control  and/or  ignorance  about  what  is  being 
measured.  Here  we  adopt  the  more  sympathetic  view  that  networks  are  generically 
random  because  they  are  dynamic  and  complex  in  which  case  statistics  provide 
information  about  the  fundamental  nature  of  that  complexity. 


3.1.  Fluctuations 

All  complex  dynamic  networks  are  stochastic,  either  due  to  intrinsic  nonlinear  dy- 
namics producing  chaos, ^^^'^^^'^^^  or  due  to  coupling  of  the  network  to  an  infinite 
dimensional  albeit  unknown  environment, or  both;  completely  aside  from  mea- 
surement errors.  Consequently,  it  is  necessary  to  understand  how  statistical  un- 
certainty may  be  included  in  modeling  allometry  data.  Kaitaniemi^^^  pointed  out 
that  the  potential  information  content  of  the  allometry  coefficient  has  been  largely 
neglected,  an  observation  also  made  by  Glazier^^  among  others.  Kaitaniemi  exam- 
ined the  different  ways  this  parameter  may  vary  for  different  sources  of  random 
ffuctuations.  Here  we  follow  a  similar  strategy,  but  we  use  actual  data  rather  than 
computer  generated  random  ffuctuations. 

The  normal  or  Gauss  pdf  suggests  that  the  statistical  variations  between  the 
variables  in  the  AR  Eq.  (2)  may  be  additive  leading  some  scientists'^' ^^^'^^^  to 
propose  the  form: 


(19) 


134 


D.  West  &  B.  J.  West 

where  r]  depicts  the  random  fluctuations  and  the  overbars  denote  the  fitted  values 
of  the  parameters.  Packard  and  Boardman^^^  also  investigate  the  regression  of  data 
to  a  three-parameter  power  law  that  does  not  pass  through  the  origin 

Y  =  Y^^aX^  ^T].  (20) 

In  the  near  isometry  case  where  6^1  linear  regression  analysis  is  appropriate 
and  additive  fluctuations  provide  a  satisfactory  representation  of  the  statistical 
variability.  On  the  other  hand  when  the  allometry  exponent  is  substantially  different 
from  unity  the  determination  of  the  nature  of  the  fluctuations  requires  preliminary 
statistical  analysis,  see,  for  example,  Packard. 

Packard  and  Boardman^^^  emphasize  that  the  additive  form  of  fluctuations  in 
ARs  is  quite  different  from  the  situation  involving  the  logarithmically  transformed 
data.  For  the  transformed  data  introducing  additive  random  fluctuations  yields: 

logF  =  loga  +  MogX  +  7^,  (21) 

and  the  empirical  constants  a  and  h  are  fit  to  the  transformed  data.  In  terms  of  the 
original  AR  we  obtain: 

Y  =  ae^X^  (22) 

with  the  fluctuations  being  exponentially  amplified  through  e^.  It  is  evident  that 
when  the  fluctuations  are  considered  to  be  focused  in  the  allometry  coefficient,  as 
they  are  here,  they  are  multiplicative.  The  multiplicative  character  of  the  fluctu- 
ations implies  that  the  influence  of  the  random  variations  is  amplified  far  beyond 
their  additive  cousins. 

They  emphasize  that  the  focus  of  the  research  on  log-transformed  data  is  to 
characterize  patterns  of  variation  in  morphology,  physiology  and  ecology  in  organ- 
isms spanning  a  broad  range  in  body  size  in  an  attempt  to  identify  underlying 
principles  in  the  design  of  biological  networks,  see,  for  example.  Brown  et  al?^  and 
references  therein.  They  go  on  to  assert  that  many  of  the  patterns  identified  by 
this  research  are  inaccurate  and  misleading  and  these  mischaracterizations  likely 
contribute  to  the  ongoing  debate  about  ways  in  which  animals  are  constructed. 

We  saw  that  fluctuations  in  the  log-transformed  data  may  be  equivalent  to  mul- 
tiplicative fluctuations  in  the  original  data.  So  the  important  question  is  whether  it 
is  necessary  to  perform  the  logarithmic  transformation  at  all.  Packard  and  Board- 
man^^^  point  out  that  the  original  motivation  for  the  log-transform  was  to  linearize 
the  equations  thought  to  represent  the  data  and  therefore  facilitate  the  implemen- 
tation of  graphical  and  statistical  analysis. ^^^'^^^  However,  they  go  on  to  show  the 
biasing  problems  associated  with  log-transforms  using  computer  generated  data 
sets  and  caution  that  with  the  present  day  computer  software  for  fitting  nonlinear 
equations  linearization  is  no  longer  a  sufficient  rationale  for  log- transforms.  So  are 
there  other  reasons  to  transform  the  data? 

Kerkhoff  and  Enquist^^^  strongly  disagree  with  the  conclusions  of  Packard 
(2008)  that  standard  methods  for  fitting  allometry  models  produce  "biased  and 


135 


On  Allometry  Relations 


misleading"  results.  They  point  out  that  most  biological  phenomena  are  inherently 
multiplicative^^'^^  and  it  is  the  proportional  rather  than  absolute  variation  that 
matters.  The  multiplicative  influence  of  noise  seen  in  the  log-transformed  data  is 
often  misinterpreted  as  bias.^^^'^^^  Kerkhoff  and  Enquist^^^  maintain  that  the  mul- 
tiplicative error  model  is  an  appropriate  feature,  rather  than  a  defect,  of  standard 
allometry  analysis.  Recent  research  suggests  that  geometric  error  resulting  from 
multiplicative  fluctuations  should  be  the  default  standard  for  parameter  estimation 
in  biology^^'^^  and  not  additive  error. 


3.2.  Phenomenological  distributions 

We  now  consider  the  statistics  of  the  fluctuations  in  the  AR  using  data  from  the 
literature.  The  data  relating  the  average  energy  expended  by  a  given  species  in  watts 
to  the  average  TBM  of  that  species  in  kilograms  for  391  species  of  mammal  is  plotted 
in  Fig.  3  and  also  in  Heusner^^  as  well  as  in  Dodds  et  al^^  A  flt  of  Eq.  (3)  to  these 
data  that  minimizes  the  mean-square  error  is  a  straight  line  on  double  logarithmic 
graph  paper  and  was  found  to  have  slope  b  =  0.71  ±  0.008  so  that  empirically 
2/3<6<3/4  and  the  allometry  coefficient  a  =  0.02.  As  West  and  West^^^  reviewed 
Heusner^^  had  somewhat  earlier  questioned  Kleiber's  value  of  3/4  and  concluded 
from  data  analysis  that  this  value  of  3/4  was  a  statistical  artifact.  Feldman  and 
McMahon^^  agreed  with  Heusner's  conclusions,  but  suggested  that  there  was  no 
compelling  reason  for  the  intraspecies  and  interspecies  allometric  exponents  to  be 
the  same,  with  the  intraspecies  exponent  being  2/3  based  on  geometric  similarity 
and  the  interspecies  exponent  being  3/4  based  on  McMahon's  elastic  similarity. 

There  is  a  great  deal  of  variability  around  the  line  segment  that  gives  the  AR 
model  in  Fig.  3.  West  and  West^^^  interpret  these  fluctuations  as  random  variations 


Fig.  3.  The  linear  regression  to  Eq.  (3)  for  Heusner's  data^-*^  is  indicated  by  the  hne  segment. 
The  slope  of  the  dashed  line  segment  is  0.71  ±  0.008. 
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in  either  the  ahometry  coefficient  or  the  allometry  exponent.  If  the  fluctuations  are 
assumed  to  be  contained  in  the  allometry  coefficient  we  can  define 

-  =  e^  =  A  (23) 

so  that  each  data  point  in  the  (X,  Fj-plane  yields  a  single  value  of  the  allometry 
coefficient.  In  a  complex  network  a  linear  response  such  as  suggested  by  Fig.  3  does 
not  necessarily  occur  since  there  can  be  independent  fluctuations  in  both  X  and 
Y  resulting  in  what  Warton  et  al?'^^  call  equation  error;  also  known  as  natural 
variability,  natural  variation  and  intrinsic  scatter.  In  the  present  case  the  AR  is  not 
predictive,  but  instead  summaries  vast  amounts  of  data.^^^  This  natural  variability 
is  manifest  in  fluctuations  in  the  allometry  parameters  (a,  6). 

West  and  West^^^  calculate  the  statistical  distribution  for  the  random  allometry 
coefficient  determined  from  Eq.  (23)  under  the  assumption  that  h  is  fixed.  The 
variability  in  the  allometry  coefficient  determined  by  the  data  is  partitioned  into 
20  equal  sized  bins  in  the  logarithm  of  the  allometry  coefficient.  A  histogram  is 
then  constructed  by  counting  the  number  of  data  points  within  each  of  the  bins 
as  indicated  by  the  dots  in  Fig.  4.  The  solid  line  segment  in  this  figure  is  the  best 
fit  to  these  20  numbers  with  minimum  mean-square  error.  The  functional  form 
for  the  histogram  is  indicated  by  the  curve  in  Fig.  4^41,242  ^j^^  quality  of  the 
fit  to  the  diversity  data  is  determined  by  the  correlation  coefficient  =  0.98. 
The  normalized  histogram  G(lna')  on  the  interval  (0,  oo)  using  the  transformation 


a 


Fig.  4.  The  histogram  of  the  deviations  from  the  prediction  of  the  AR  using  the  data  depicted 
in  Fig.  3  partitioned  into  20  equal  sized  bins  in  the  logarithm  of  the  normalized  variable  a'  =  a/d. 
Here  d  =  0.02  and  b  =  0.71.  The  solid  line  segment  is  the  best  fit  of  Eq.  (24)  to  the  20  histogram 
numbers,  which  yields  the  power-law  index  a  =  2.79  and  the  quality  of  the  fit  is  measured  by  the 
correlation  coefficient      =  0.98.  (Reproduced  from  West  and  West^^^  with  permission). 
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G{hia')dhia'  =  P{a')da'  gives  the  empirical  pdf: 

1 

(24) 


1 


and  a  =  3.28  yielding  a  standard  deviation  0.017  in  essentially  exact  agreement 
with  the  empirical  data.  Note  that  this  coefficient  differs  from  the  best  fit  value 
given  in  the  caption  of  Fig.  4  but  this  results  in  only  a  1%  change  in  the  value 
of  r^. 

The  same  inverse  power-law  form  is  obtained  with  a  —  3.89  and  r  =  0.96  using 
the  avian  BMR  data  of  McNab^^^  for  533  species  of  bird.  The  distribution  of  the 
deviations  from  the  AR  for  both  the  avian  and  mammalian  data  sets  fall  off  as 
inverse  power  laws  on  either  side  of  a  =  a.  Equation  (24)  quantifies  the  qualitative 
argument  used  earlier  to  associate  power-law  pti/'s  with  multiplicative  fluctuations, 
see  also  Sec.  5. 

Alternatively  the  fluctuations  can  be  assumed  to  be  contained  within  the  allom- 
etry exponent  as: 

^        logM  ^  ' 

If  we  assume  h  =  0.71  and  a  =  a  then  Eq.  (25)  provides  us  with  the  statistical 
fluctuations  in  the  allometry  exponent  are  used  to  construct  a  histogram  exactly  as 
we  did  previously.  The  solid  line  segment  in  Fig.  5  is  the  best  fit  to  the  20  numbers 
of  the  histogram  with  minimum  mean-square  error.  The  functional  form  for  the 
histogram  of  deviations  from  the  allometry  exponent  h  is  determined  by  the  curve 
in  Fig.  5  and  the  quality  of  the  fit  to  the  histogram  is  determined  by  the  correlation 
coefficient      =  0.97.  The  histogram  is  fit  by  the  Laplace  pdf 

vl/(6)  =  ^exp[-/?|6-5|],  (26) 
with  the  empirical  value  /3  =  12.85. 


3.2.1.  Co-variation  of  allometry  coefficient  and  exponent 

We  have  obtained  two  separate  distributions  for  two  different  parametric  representa- 
tions of  the  same  data.  As  pointed  out  by  Glaizer ''^  species  within  a  taxon  represent 
a  cloud  of  different  metabolic  rates  and  not  the  linear  metabolic  level  determined  by 
linear  regression.  His  cloud  is  incorporated  into  the  present  context  by  abandoning 
the  assumption  that  the  allometry  coefficient  and  exponent  are  independent  and 
requiring  that  the  probability  of  a  given  fluctuation  is  the  same  regardless  of  the 
representation  so  that  V{a)da  =  ^{h)dh.  In  order  to  calculate  a  nonzero  Jacobian 
of  the  transformation  between  the  two  allometry  parameters  requires  that  they  be 
functionally  related.  West  and  West^^^  assume  h  =  6  — clna,  so  that  by  inserting  the 
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Fig.  5.  The  histogram  of  the  deviations  from  the  prediction  of  the  AR  using  the  allometry 
exponent  fluctuation  data  partitioned  into  20  equal  sized  bins.  The  sohd  hne  segment  is  the  best 
flt  of  Eq.  (26)  with  A6  =  6  — 6,  to  the  20  histogram  numbers,  and  the  quahty  of  the  flt  is  measured 
by  the  correlation  coeflicient      =  0.97.  (Reproduced  from  West  and  West^^^  with  permission). 


allometry  exponent  pdf  Eq.  (26)  into  that  for  equal  probabilities  and  simplifying 
yields: 

^    ,  a^^~^      for  a  <a 
pc 

T 


V{a) 


1 


for  a>  a 


(27) 


and  comparing  Eq.  (27)  with  Eq.  (24)  they  identify  a  =  13c  and  obtain  the  empirical 
pdf  for  the  allometry  coefficient  P{a')da'  =  V{a)da.  Using  the  fitted  values  a  =  2.79 
and  /3  =  12.85  yields  c  =  0.217  and  consequently  the  empirical  transformation  can 
be  written  as: 


6  =  0.71 -0.50  log 


10' 


(28) 


Note  that  this  functional  dependency  of  the  allometry  exponent  on  the  allometry  co- 
efficient is  consistent  with  the  empirical  co- variation  relation  obtained  by  Glazier 
who  used  linear  regression  on  large  amounts  of  parametric  data.  West  and  West^^^ 
emphasize  that  this  co- variation  of  the  allometry  parameters  implies  that  there  is 
no  universal  value  for  the  allometry  exponent. 


3.2.2.  Other  scaling  distributions 

The  tent  shaped  distribution  in  Fig.  5  also  arises  using  a  different  approach  to 
quantifying  the  variability  of  BMR.  Labra  et  al}^^  investigate  BMR  fluctuations 
by  considering  O2  volume  time  series  and  examining  the  scaling  of  the  high  fre- 
quency fluctuations  across  species.  They  determined  empirically  that  the  standard 
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deviation  in  the  BMR  is  proportional  to  the  average  BMR  by  the  scahng  law: 

SDbmr 

and  A  =  —0.352  zb  0.072.  This  relation  is  known  as  the  power  curve  in  the  ecology 
liter ature^^  and  was  discovered  by  Taylor^ in  his  determination  of  the  number  of 
new  species  that  can  be  found  in  a  given  plot  of  ground.  It  was  expanded  to  the 
determination  of  scaling  in  time  series  by  Taylor  and  Woiwold^^^  and  applied  in 
physiology  as  Taylor's  law  in  West.^^^ 

On  the  other  hand  Labra  et  al}^^  determined  the  standard  deviation  of  BMR 
to  be  proportional  to  a  power  of  the  TBM: 

SDbmr  cx 

and  7  =  — 0.24 1±0. 103.  Consequently,  combining  the  two  empirical  expressions  for 
the  variance  we  obtain: 

B  oc  M^/^  (29) 

with  7/A  =  0.69  a  value  consistent  with  the  many  fits  made  to  the  allometry  data. 
They  determine  that  all  the  species  they  studied  show  the  same  invariant  distribu- 
tion of  BMR  fluctuations,  regardless  of  the  difference  in  their  phylogeny,  physiology 
and  body  size.  The  distribution  has  the  form  Eq.  (26)  with  the  independent  vari- 
able given  by  the  fluctuation  in  the  BMR  AB  and  P  =  V^/ SDbmr'',  in  terms  of  the 
scaling  variable  B^c  =  \/2A5/ SDbmr  curves  for  12  different  species  collapse  onto 
a  single  universal  curve. 


3.2.3.  Paleobiology  and  scaling 

Phenomena  with  intermittent  properties  described  by  inverse  power-law  statistics 
are  apprarently  ubiquitous^^^  and  paleobiology  is  no  exception.  Bak  and  Boettcher^ 
interpreted  Charles  Lyell's^^^  uniformitarianism  as  meaning  that  all  geologic  activ- 
ity should  be  explainable  in  terms  of  readily  available  processes  working  at  all  times 
and  all  places  with  the  same  intensity.  They  go  on  to  argue  that  the  existence  of 
earthquakes,  volcanic  eruptions,  floods  and  tsunamis  all  indicate  that  the  physical 
world  is  not  in  equilibrium.  Moreover  the  intermittent  nature  of  the  paleontological 
record  indicates  that  macroevolution  is  also  out  of  equilibrium  and  consequently 
the  inverse  power-law  statistics  are  possibley  suitable  for  their  description. 

Eldredge  and  Gould^^  argued  that  punctuated  change  dominates  the  history  of 
life  and  that  relatively  rapid  episodes  of  speciation  constitute  biological  macroevo- 
lution. The  intermittency  of  speciation  in  time  has  been  explained  by  one  group 
as  punctuated  equilibria^^  and  has  been  indirectly  related  to  fractal  statistics  by 
identifying  it  as  a  self-organized  critical  phenomenon.^  In  the  self-organized  criti- 
cality  model  of  speciation  Bak  and  Boettcher^  associate  an  avalanche  of  activity 
with  exceeding  a  threshold  and  the  distribution  of  returns  to  the  threshold  with  a 
"devil's  staircase"  having  a  distribution  of  steps  of  stasis  of  lengths  given  by  the 
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inverse  power-law  pdf^  that  is,  with  7  =  1.75.  Moreover,  as  explained  by  Snep- 
pen  et  al.,'^^^  the  number  of  genera  TV,  with  a  lifetime  T  can  be  fitted  very  well 
to  a  power  law  with  /3  ^  2}^^  More  recently  Rikvold  and  Zia^^^  put  forward 
an  explanation  of  punctuated  equilibrium  based  on  1  //  noise  in  macroevolutionary 
dynamics  that  also  yields  an  inverse  power-law  pdf  for  the  life  time  of  ecological 
communities  with  (3  =  2. 

Sole  et  al?^^  analyzed  the  statistics  of  the  extinction  fossil  record  (time  series) 
and  determined  that  the  power  spectrum  has  the  form: 

5(/)cxl//^ 

with  0  <  /i  <  2.  They  find  0.80  <  ji  <  0.90  and  argue  that  these  values  support 
the  self-organized  criticality  interpretation  of  extinction.  On  the  other  hand.  Plot- 
nick  and  Sepkowski^^^  also  find  a  1//  power  spectrum  with  a  power-law  index  for 
extinction  consistent  with  Sole  et  al?'^^  and  indices  for  species  generation  of  ap- 
proximately half  that  for  extinction.  However  the  latter  authors  conclude  that  their 
results  are  incompatible  with  self-organized  criticality  and  instead  are  compatible 
with  multifractal  self-similarity  in  both  the  extinction  and  generation  records. 


4.  Models  of  Allometry 

Two  distinct  methods  dominate  the  many  derivations  of  AR.  One  method  is  based 
on  the  first-principles  reductionistic  approach  starting  from  an  assumed  form  for  the 
underlying  mechanisms  and  from  that  deducing  the  necessity  of  Eq.  (2).  The  other 
method  is  phenomenological  involving  statistical  analysis  and  identifying  patterns 
in  the  data  analysis  and  from  these  patterns  deducing  the  necessity  of  the  empirical 
AR.  A  number  of  the  statistical  methods  were  reviewed  in  the  previous  section.  The 
ARs  stand  out  as  empirical  relations  that  have  withstood  the  test  of  time,  whereas 
the  same  cannot  be  said  for  the  models  developed  to  explain  how  they  come  about. 
In  this  section  we  examine  attempts  to  formulate  general  principles  from  which  the 
underlying  mechanisms,  whether  reductionist  or  statistical,  producing  the  ARs  can 
be  identified. 

The  search  for  an  unifying  principle  parallels  such  historical  activities  as  the 
determination  of  "least  action"  in  analytic  mechanics  or  optimization  in  control 
theory.  More  recently  investigators  have  rediscovered  Newton's  "principle  of  simil- 
itude'" introduced  in  the  Principia  (II,  Proposition  32).  Scaling  and  the  princi- 
ple of  similitude  have  been  present  in  the  study  of  complex  physical  phenomena 
since  physics  became  a  science.  In  modern  times  it  is  RG  theory  that  provides  a 
formalism  for  determining  how  force  is  transferred  across  multiple  scale. ^^^'^^^'^^^ 
Part  of  the  reason  for  exploring  this  approach  is  that  fractal  geometry  and  frac- 
tal statistics  are  able  to  capture  some  of  the  regularity  observed  in  vast  amounts 
of  data  in  the  life  and  social  sciences  in  addition  to  the  physical  sciences.  The 
implementation  of  fractal  geometry  and  RG  theory  to  study  the  architecture  of 
physiological  forms, ^^'^^^'^^^  interacting  networks  of  chemical  reactions^^'^^'^^^  and 
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the  topology  of  ecological  webs^''  over  the  past  quarter  century  has  lead  to  some 
remarkable  insights.  In  particular  the  descriptive  success  of  investigations  using  frac- 
tals^^'^^^'^^^'^^^  suggests  that  branching  networks  with  fractal  architectures  have  an 
evolutionary  advantage^^^'^^^'^^^  as  do  fractal  stochastic  processes. 

4.1.  Optimization  principles 

Optimization  provides  one  of  the  best  recipes  for  determining  network  dynamics 
consistent  with  a  given  set  of  constraints.  It  is  through  the  specification  of  the 
constraints  that  insight  into  a  phenomenon  is  introduced.  The  biological  sciences 
employ  ideas  such  as  energy  minimization;  the  minimal  use  of  materials  along  with 
the  maximization  of  efficiency.  So  whether  the  extremum  is  a  minimum  or  maximum 
depends  on  one's  purpose  and  the  quantity  being  varied. 

4.1.1.  Energy  minimization 

Scaling  relations  in  living  networks  result  from  the  balancing  of  various  constraints. 
One  such  biological  balancing  is  that  of  energy  utilization  against  the  energy  cost  of 
carrying  out  a  biological  function.  A  technique  that  has  been  used  both  implicitly 
and  explicitly  in  the  derivation  of  ARs  is  supplying  nutrients  to  various  parts  of 
an  organism  through  the  venous  and  capillary  networks  as  well  as  through  the 
respiratory  network.  Murray^^^  considered  a  fluid  with  viscosity  u  and  laminar  flow 
Q  within  a  tube  of  length  /  and  radius  r.  The  flow  had  to  overcome  the  vascular 
resistance  that  generates  a  pressure  difference  along  the  length  of  the  tube  given 
by  Poiseuille's  law  Ap  =  SlvQ/irr^. 

A  constraint  on  the  flow  is  the  cost  of  transporting  fluid  of  cylindrical  volume 
V  =  7r/r^  along  the  tube  so  that  introducing  c  as  the  cost  factor  the  total  work  to 
be  done  per  unit  time  is  given  by: 

E  =  QAp  ^cV=  +  CTTlr^  .  (30) 

Consequently,  minimizing  this  expression  with  respect  to  the  radius  yields  the  opti- 
mal flow:  Q  =  Cr^  with  the  constant  C  =  ^/c7T^/16u.  The  cubic  dependence  of  the 
flow  rate  on  radius  is  known  as  Murray's  law^^^  and  is  indicative  of  the  maximum 
efficiency  of  the  flow. 

Murray^^^  subsequently  extended  his  result  to  bifurcating  networks  such  as 
occurs  in  bronchial  airways.  The  flow  from  a  parent  vessel  of  radius  tq  branches 
into  two  daughter  vessels  ri  and  r2  such  that  the  flow  divides: 

Qo  =  Qi  +  Q2  . 

Therefore  inserting  Murray's  law  into  this  expression  yields  da  Vinci's  equation  with 
a  =  3  and  in  the  case  of  equal  radii  in  the  daughter  branches  ri  =  r2  we  obtain 
the  scaling  relation  ri  =  2~^/^ro.  Thus,  the  maximally  efficient  bifurcating  network 
in  terms  of  energy  transport  cost  has  radii  decreasing  as  the  cube  root  of  two.  As 
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Fig.  6.  Sketch  of  a  branching  structure  such  as  a  blood  vessel  or  bronchial  airway  with  the 
parameters  used  in  a  bifrucating  network  model. 

pointed  out  by  Weibel^^^  this  last  result  is  also  known  as  Murray's  law.  However 
this  law  was  actually  first  formulated  by  Hess^^  for  blood  vessels  and  subsequently 
is  more  properly  called  the  Hess-Murray  law. 

4.1.2.  Optimal  design 

Rashevsky^ ''^  introduced  the  principle  of  optimal  design  in  which  the  material  used 
and  the  energy  expended  to  achieve  a  prescribed  function  is  minimal.  He  applied  the 
principle  to  the  basic  problem  of  how  the  arterial  network  could  branch  in  space 
in  order  to  supply  blood  to  every  element  of  tissue.  To  address  this  problem  he 
used  the  model  of  a  bifurcating  branching  network  supplying  blood  to  a  restricted 
volume  and  reducing  the  total  resistance  to  the  flow  of  blood.  His  purpose  was  to 
determine  the  condition  imposed  by  the  requirement  that  the  total  resistance  is 
minimum. 

Here  we  assume  the  branching  network  is  composed  of  N  generations  from  the 
point  of  entry  (0)  to  the  terminal  branches  {N).  A  typical  tube  at  some  intermediate 
generation  k  has  length  //c,  radius  Vk  and  pressure  drop  across  the  length  of  the 
branch  Ap^  as  sketched  in  Fig.  6.  The  volume  flow  rate  Q/c  is  expressed  in  terms 
of  the  flow  velocity  averaged  over  the  cross  sectional  area  Uk  :  Qk  =  Trr^iifc.  Each 
tube  branches  into  n  smaller  tubes  with  the  branching  of  the  vessel  occurring  over 
some  distance  that  is  substantially  smaller  than  the  lengths  of  the  tubes  of  either 
generation.  Consequently,  the  total  number  of  branches  generated  up  to  generation 
k  is  Nk  =  n^.  The  pressure  difference  at  generation  k  between  the  ends  of  a  tube 
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is  given  by  a  suitably  indexed  version  of  Poiseuille's  law  and  the  total  resistance  to 
the  flow  is  given  by  the  ratio  of  the  pressure  to  flow  rate 

Apfc  _  Sulk 
Qk  Trrl 

The  total  resistance  for  a  network  branch  with  m  identical  tubes  in  parallel  is 
1/m  the  resistance  of  each  individual  tube.  Thus,  in  this  oversimplifled  case  we  can 
write  the  total  network  resistance  as: 

=  ^  +  (32) 

In  order  to  minimize  the  resistance  for  a  given  mass  Rachevsky  flrst  expressed  the 
initial  radius  tq  in  terms  of  the  total  mass  of  the  network.  The  optimum  radii 
for  the  different  branches  of  the  bifurcation  network  having  the  total  mass  M  are 
then  determined  such  that  the  total  resistance  is  a  minimum  dQr  / drj  =  0  yielding 

—  1/3 

the  equality  rk  =  vq.  The  ratio  of  the  radii  between  successive  generations 

is  r/e+i/r/e  =  (A/'/e/7V/e+i)"^/^so  that  inserting  the  number  of  branches  at  the  /cth 
generation  Nk  =  yields: 

rfe+iK  =n-l/^  (33) 

yielding  an  exponential  reduction  in  the  branch  radii  across  generations.  Note  the 
formal  similarity  of  this  ratio  to  Horton's  law  in  which  the  ratio  of  numbers  of  river 
branches  is  independent  of  generation  number. 

Rashevsky  considered  the  bifurcating  case  n  =  2  where  the  ratio  of  radii  reduces 
to  Vk+i/vk  =  2~^/^  =  0.794.  This  is  the  classic  "cube  law"  branching  of  Thomp- 
gQj^2i3  which  he  used  the  "principle  of  similitude".  The  value  2~^/^  was  obtained 
by  Weibel  and  Gomez^^^  for  the  reduction  in  the  diameter  of  bronchial  airways  for 
the  flrst  ten  generations  of  the  bronchial  tree.  However  they  noted  a  sharp  devia- 
tion away  from  this  constant  fractional  reduction  after  the  tenth  generation.  The 
value  2~^/^  was  also  obtained  by  Wilson^^^  who  explained  the  proposed  exponen- 
tial decrease  in  the  average  radius  of  a  bronchial  tube  with  generation  number  by 
showing  that  this  is  the  functional  form  for  which  a  gas  of  given  composition  can 
be  provided  to  the  alveoli  with  minimum  metabolism  or  entropy  production  in  the 
respiratory  musculature.  He  proposed  minimum  entropy  production^^  as  the  design 
principle  for  biological  networks  to  carry  out  a  given  function. 

The  deviation  from  classical  (exponential)  scaling  above  generation  ten  shown  in 
Fig.  7  was  eventually  explained  using  an  alternative  model  of  the  bronchial  airways 
in  terms  of  fractal  statistics^^^  as  we  subsequently  discuss. 

4.2.  Metabolic  allometry 

Barenblatt  and  Monin^^  proposed  that  metabolic  scaling  might  be  a  consequence 
of  the  fractal  nature  of  biology,  but  they  did  not  provide  a  mechanistic  model  for  its 
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Generation  {i) 


Fig.  7.  As  the  bronchial  tree  branches  out,  its  tubes  on  average  decrease  in  size.  A  theory  con- 
sistent with  the  Principle  of  Similitude-^-*^^  predicts  that  their  diameters  should  decrease  by  about 
the  same  ratio  from  one  generation  to  the  next;  exponential  decline.  This  semilog  graph  shows 
measurements  from  Weibel  and  Gomez-^-^-^  for  23  generations  of  bronchial  tubes  in  the  human 
lung.  The  prediction  is  a  straight  line  that  fits  the  anatomic  data  (dots)  until  about  the  tenth 
generation,  after  which  they  deviate  systematically  from  an  exponential  decline,  [adapted  from 
West  and  Goldberger.^^^] 

description.  This  shortcoming  has  been  overcome  by  a  number  of  investigators  who 
have  devised  numerous  fractal  models  to  describe  AR  in  a  variety  of  contexts. ^^^'^^^ 

4.2.1.  Elastic  similarity  model 

The  first  model  to  analytically  predict  the  allometry  exponent  3/4  was  constructed 
by  McMahon.^^^'^^^  His  argument  rests  on  the  observation  that  the  weight  of  a  col- 
umn increases  more  rapidly  with  size  than  does  its  strength.  Moreover  as  discussed 
by  Schmidt-Nielsen^^^  if  the  column  is  tall  and  slender  it  can  fail  due  to  elastic 
buckling  in  which  small  lateral  displacements  exceed  the  elastic  restoring  forces. 
For  a  sufficiently  slender  column  with  Young's  elastic  modulus  E  and  density  p  the 
critical  length  of  a  column  of  diameter  d  is: 

=  k{E/py^'-<f/^  (34) 

and  /c  is  a  constant.  The  elastic  criteria  of  McMahon  is  therefore  given  by  l^  o:  d^ . 
The  weight  of  the  column  is  given  by  the  product  of  the  density,  length  and  cross- 
sectional  area, 

Mg  =  pd^l'^nd^l^, 

where  the  length  has  been  replaced  using  the  elastic  criteria  yielding  d  oc  M^/^. 
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The  implications  for  allometry  of  the  scaling  between  cylinder  diameter  and 
mass  was  discussed  by  Calder^^  using  the  symmorphosis  hypothesis  of  Taylor  and 
Weibel:^^^  . .  is  regulated  to  satisfy  but  not  exceed  the  requirements  of  the  func- 
tional system". First  off  we  recognize  that  locomotion  requires  the  contraction 
of  muscles.  During  contraction,  muscles  exert  a  force  that  increases  with  the  cross- 
sectional  area  of  the  muscle.  The  power  output  of  the  muscle  is  the  work  done  per 
unit  time  and  may  be  equated  with  the  metabolic  rate,  the  product  of  the  force 
generated  by  the  muscle  and  the  velocity  u  of  the  shortening  of  the  muscle  B  cc  cPu. 
The  velocity  of  the  shortening  of  the  muscle  appears  to  be  a  size-independent  con- 
stant from  species  to  species^^  so  that  using  these  two  equations  for  the  diameter 
yields: 

B  oc  M^^^  . 

Consequently,  the  allometry  exponent  3/4  in  McMahon's  elastic  similarity  model 
is  required  to  maintain  the  flow  of  energy  to  the  working  muscles  and  is  consistent 
with  the  principle  of  symmorphosis. 

Versions  of  the  above  argument  given  by  both  Calder^^  and  Schmidt-Nielsen^^^ 
seem  to  explain  the  value  of  the  allometry  exponent  for  warm  blooded  animals. 
Dodds  et  al^^  critique  McMahon's  model  by  noting  that  there  is  no  compelling 
reason  why  the  power  output  of  muscles  should  be  the  dominant  factor  in  the  scaling 
of  BMR.  Moreover,  Savage  et  al}^^  point  out  that  while  the  elastic  similarity  model 
might  apply  to  the  bones  of  mammals  or  the  trunks  of  trees  that  have  adapted  to 
gravitational  forces,  it  is  doubtful  that  it  is  applicable  to  aquatic  or  unicellular 
organisms  that  also  display  an  allometry  exponent  of  3/4.^'' 

4.2.2.  WEE  model 

West,  Brown  and  Enquist^^''  (WEE)  published  a  quantitative  model  of  metabolic 
AR  that  has  had  significant  impact  on  how  a  significant  fraction  of  today's  bi- 
ology/ecology community  understands  metabolic  ARs.  WBE  model  nutrient  dis- 
tribution within  a  hierarchal  network  in  which  vessels  become  narrower,  shorter 
and  more  numerous  between  successive  levels  proceeding  from  the  initial  to  the 
terminal  level  reminiscent  of  representations  of  river  branchings.  The  scaling  in 
the  transport  network  is  a  consequence  of  the  constraints  imposed  by  three  as- 
sumptions: (1)  The  entire  volume  of  the  organism  is  crammed  with  a  space-filling 
branching  network.  (2)  The  tube  properties  at  the  terminus  of  the  network  are 
size- invariant.  (3)  The  energy  required  to  distribute  resources  using  this  network 
is  minimal,  that  is,  the  hydrodynamic  resistance  of  the  network  is  minimized.  We 
note  their  claim  that  this  model  is  the  origin  of  universal  scaling  laws  in  biol- 
ogy^^^'^^^  with  b  =  3/4.  However  we  recall  at  the  start  that  the  existence  of  an 
empirical  exponent  b  =  3/4  for  metabolic  AR  has  been  questioned  by  numer- 
ous investigators^^'^^'^^^'^^^'^^^  and  we  address  these  concerns  and  others  in  due 
course. 
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WBE  introduce  two  parameters  to  characterize  the  network  branching  process, 
13k  =  r/e+i/r/e  the  ratio  of  successive  branch  radii,  as  was  done  in  the  energy  mini- 
mization arguments^ ''^  and  the  other  is  =  h+i/h^  the  ratio  of  successive  branch 
lengths.  They  conclude  that  the  total  number  of  terminal  branches  scales  with  TBM 
as  Nt  =  {M/Mq)^.  The  total  number  of  branches  at  level  k  is  =  .  Conse- 
quently at  the  network  terminus  the  self-similarity  of  the  network  yields  Nt  = 
and  consequently  the  total  number  of  branches  is: 

N  =  61n(M/Mo)/lnn.  (35) 

Rashevsky's  energy  minimization  argument  indicates  that  the  transport  of  nutrients 
in  complex  networks  is  maximally  efficient  when  j3k  is  independent  of  k.  This  is  the 
"fractal"  scaling  assumption  made  by  WBE. 

The  estimates  of  the  ratio  parameters  required  two  separate  assumptions.  To 
estimate  the  ratio  of  lengths  WBE  assume  that  the  volume  of  a  tube  at  generation  k 
can  be  replaced  by  a  spherical  volume  of  diameter  Ik  and  in  this  way  implement  the 
space-filling  assumption.  The  conservation  of  volume  between  generations  therefore 
leads  to  the  space  filling  condition: 

Hk+i/lkf  ^Nk/Nk+i  =n-^ 
and  consequently  to  the  /^-independent  parameter 

7  =  7fe  =  ' 

They  maintain  that  this  level-independent  scaling  of  the  lengths  is  a  generic  prop- 
erty of  all  the  space-filling  networks  they  consider.  This  condition  for  an  n-branching 
network  with  a  reduction  in  size  of  n~^/^  between  successive  generations  yields  us- 
ing the  definition  for  the  fractal  dimension  D  =  log  n/ log  n^/^  =  3. 

A  separate  and  distinct  assumption  is  made  to  estimate  P  using  the  classic 
rigid-pipe  model  to  equate  the  cross-sectional  areas  between  successive  generations: 
Trr'j  =  n7rr|^2  5      that  we  have 

l3  =  /3k  =  . 

Note  that  this  scaling  of  /3  differs  from  the  ratio  parameter  obtained  using  energy 
minimization.  Inserting  these  values  of  the  scaling  parameters  into  the  expression 
for  b  yields  the  sought  after  exponent  b  =  3/4.  This  value  of  the  exponent  in  turn 
determines  a  number  of  other  quarter-power  scaling  laws. 

WBE  point  out  that  this  is  strictly  a  geometrical  argument  applying  only  to 
those  networks  that  exhibit  area-preserving  branching.  Moreover  the  fluid  velocity 
is  constant  throughout  the  network  and  it  is  independent  of  size.  They  go  on  to 
say  that  these  features  are  a  natural  consequence  of  the  idealized  vessel-bundle 
structure  of  plant  vascular  networks  in  which  area-preserving  arises  automatically 
because  each  branch  is  assumed  to  be  a  bundle  of  n^~^  elementary  vessels  of  the 
same  radius.  They  recognized  that  this  is  not  the  situation  with  vascular  blood 
flow  where  the  beating  of  the  heart  produces  a  pulsating  flow  that  generates  a  very 
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different  kind  of  scaling.  Area-preserving  is  also  not  true  in  the  mammalian  lung 
where  there  is  a  distribution  of  radii  at  each  level  of  branching. 

A  physical  property  that  the  area  preserving  condition  violates  is  that  blood 
slows  down  in  going  from  the  aorta  to  the  capillary  bed.  Here  WBE  return  to  the 
principle  of  energy  minimization  and  as  stated  by  West^^^  assert  that  to  sustain  a 
given  metabolic  rate  in  an  organism  of  fixed  mass,  with  a  given  volume  of  blood, 
the  cardiac  output  is  minimized  subject  to  a  space-filling  geometry.  This  variation 
is  essentially  equivalent  to  minimizing  the  total  impedance  since  the  fiow  rate  is 
constant  and  again  yields  the  Hess-Murray  law  f3  =  n~^^^  corresponding  to  area- 
increasing  branching.^' This  change  in  scaling  from  the  area-preserving  n~^l^ 
to  the  area- increasing  n~^/^  solves  the  problem  of  slowing  down  blood  fiow  to 
accommodate  diffusion  at  the  capillary  level.  Moreover,  the  variation  also  leads  to 
an  allometry  exponent  6=1.  Such  an  isometric  scaling  (6  =  1)  suggests  that  plants 
and  animals  follow  different  allometry  scaling  relations  as  was  found. ^^^'^^^ 

A  detailed  treatment  of  pulsate  fiow  is  not  straightforward  and  will  not  be 
presented  here,  but  see  Savage  et  al}^^;  Silva  et  al.'^^^  and  Apol  et  al.^  for  details 
and  commentary  in  the  context  of  the  WBE  model.  We  do  note  that  for  blood  fiow 
the  walls  of  the  tubes  are  elastic  and  consequently  the  impedance  is  complex,  as  is 
the  dispersion  relation  that  determines  the  velocity  of  the  wave  and  its  frequency. 
Consequently  pulsate  fiow  is  attenuated^^'^^  and  WBE  argue  that  the  impedance 
changes  its  r-dependence  from  for  large  tubes  to  for  small  tubes.  The 
variation  therefore  changes  from  area-preserving  fiow  (3  =  n~^/^  for  large  vessels  to 
dissipative  fiow  (3  =  n~^/^  for  small  vessels  where  blood  fiow  is  forced  to  slow.  Thus 
Pk  is  /c-dependent  in  the  WBE  model  for  pulsate  fiow  and  at  an  intermediate  value  of 
k  the  scaling  changes  and  this  changeover  value  is  species  dependent.  These  results 
are  contradicted  in  the  more  extensive  analysis  of  pulsate  fiow  by  Apol  et  al.^^  who 
conclude  that  Kleiber's  law  remains  theoretically  unexplained. 

Kozlowski  and  Konarzewski^^^  critique  the  apparent  limitations  of  the  WBE 
model  assumptions.  The  size-invariance  assumption  has  been  interpreted  by  them 
to  mean  that  Nt  oc  M,  that  is,  the  terminal  number  of  vessels  scales  isometrically 
with  size  and  consequently  causes  the  number  of  levels  to  be  a  function  of  body 
size  since  more  levels  are  required  to  fill  a  larger  volume  with  the  same  density  of 
final  vessels.  However  Brown  et  al?^  assert  that  NtVt  oc  M  so  that 

Nt  oc  M^^^  (36) 

to  which  Kozlowski  and  Konarzewski^^^  respond  that  such  mass  dependence  is 
an  arbitrary  assumption  and  is  not  proven,  see  also  Dawson. Etienne  et  al.^^ 
reconstruct  the  WBE  model  without  making  the  self-similarity  assumption  and  in 
so  doing  satisfy  the  concerns  of  Kozlowski  and  Konarzewske.^^^'^^^  The  arguments 
remain  unresolved  and  Cyr  and  Walker^^  refer  to  the  assumption  embodied  in 
Eq.  (36)  as  the  illusion  of  mechanistic  understanding  and  maintain  that  after  a 
century  of  work  the  jury  is  still  out  on  the  magnitude  of  the  allometry  exponents. 
Riisgard^^^  argues  that  respiration  and  growth  are  integrated  through  the  energetic 
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costs  of  growth  and  that  this  explains  why  the  h  value  is  not  a  "natural  constant" 
and  that  a  "3/4  power  scaling  law  cannot  be  deduced  from  the  interplay  between 
pure  physical  and  geometric  constraints  of  the  transport  of  oxygen. 

A  quite  different  critique  comes  from  Savage  et  al}^^  who  emphasize  that  the 
WBE  model  is  only  valid  in  the  limit  TV  ^  oo,  that  is,  for  infinite  network  size  (body 
mass)  and  that  the  actual  allometry  exponent  predicted  depends  on  the  sizes  of  the 
organisms  considered.  They  calculate  that  the  AR  between  BMR  and  TBM  have 
corrections  for  finite  TV  given  by: 


from  which  it  is  clear  that  6  =  3/4  only  occurs  when  a2B^^^  ^  ai.  This  inequal- 
ity is  not  satisfied  for  bodies  of  finite  size.  In  their  original  publication  WBE  ac- 
knowledged the  potential  importance  of  such  finite-size  effects,  especially  for  small 
animals,  but  the  magnitude  of  the  effect  remained  unspecified.  Using  explicit  ex- 
pressions for  the  coefficients  in  the  WBE  model  Savage  et  al}^^  show  that  when 
accounting  for  these  corrections  over  a  size  range  spanning  the  eight  orders  of  mag- 
nitude observed  in  mammals  a  scaling  exponent  of  6  =  0.81  is  obtained.  Moreover  in 
addition  to  this  strong  deviation  from  the  desired  value  of  3/4  there  is  a  curvilinear 
relation  between  the  TBM  and  the  BMR  in  the  WBE  model  given  by: 


whose  curvature  is  opposite  to  that  observed  in  the  data.  Consequently  they  con- 
clude that  the  WBE  model  needs  to  be  amended  and/or  the  data  analysis  needs 
reassessment  to  resolve  this  discrepancy.  A  start  in  this  direction  has  been  made 
by  Kolokotrones  et  al}^^  Agutter  and  Tuszynski^  also  review  the  evidence  that  the 
fractal  network  theory  for  the  two- variable  AR  is  invalid. 

Another  variation  on  this  theme  was  made  by  Price  et  al}^^  who  relax  the 
fractal  scaling  assumptions  of  WBE  and  show  that  allometry  exponents  are  highly 
constrained  and  covary  according  to  specific  quantitative  functions.  Their  results 
emphasize  the  importance  of  network  geometry  in  determining  the  allometry  expo- 
nents and  supports  the  hypothesis  that  natural  selection  minimizes  hydrodynamic 
resistance.  Moreover  they  extended  McMahon's  elastic  similarity  model  and  ap- 
ply it  to  plant  scaling  exponents  showing  its  consistency  with  their  modification  of 


Prior  to  WBE  there  was  no  unified  theoretical  explanation  of  quarter-power 
scaling.  Banavar  et  al}'^  show  that  the  3/4  exponent  emerges  naturally  as  an  upper 
bound  for  the  scaling  of  metabolic  rate  in  the  radial  explosion  network  and  in 
the  hierarchical  branching  networks  models  and  they  point  out  that  quarter-power 
scaling  can  arise  even  when  the  underlying  network  is  not  fractal. 

Finally,  Weibel^^^  presents  a  simple  and  compelling  argument  on  the  limitations 
of  the  WBE  model  in  terms  of  transitioning  from  BMR  to  the  maximal  metabolic 
rate  (MMR)  induced  by  exercise.  The  AR  for  MMR  has  an  exponent  h  =  0.86 


M  =  aiB  ^  a2B^^^ 


(37) 


WBE. 
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rather  than  3/4,  so  that  a  different  approach  to  determining  the  exponent  is  needed. 
Painter^^^  demonstrates  that  the  empirical  ahometry  exponent  for  MMR  can  be 
obtained  in  the  manner  pioneered  by  WBE  by  using  the  Hess-Murray  law  for  the 
scaling  of  branch  sizes  between  levels. 

Weibel^^^  argues  that  a  single  cause  for  the  power  function  arising  from  a  fractal 
network  is  not  as  reasonable  as  a  model  involving  multiple  causes,  see  also  Agut- 
ter  and  Tuszynski.^  Darveau  et  al.^^  propose  such  a  model  recognizing  that  the 
metabolic  rate  is  a  complex  property  resulting  from  a  combination  of  functions. 
West  et  al?^^  and  Banavar  et  al}^  demonstrate  that  the  mathematics  in  the  dis- 
tributed control  model  of  Darveau  et  al^^  is  fundamentally  flawed.  In  their  reply 
Darveau  et  al.'^'^  do  not  contest  the  mathematical  criticism  and  instead  point  out 
consistency  of  the  multiple-cause  model  of  metabolic  scaling  with  what  is  known 
from  biochemical^^^  and  physiological^^^  analysis  of  metabolic  control.  The  notion 
of  distributed  control  remains  an  attractive  alternative  to  the  single  cause  mod- 
els of  metabolic  AR.  A  mathematically  rigorous  development  of  AR  with  fractal 
responses  from  multiple  causes  was  recently  given  by  Vlad  et  a/.^^^  in  a  general 
context.  This  latter  approach  may  answer  the  formal  questions  posed  by  many  of 
these  critics. 


4.3.  Why  fractal  transport? 

Why  are  fractals  important  in  the  design  of  allometry  networks?  Barenblatt  and 
Monin^^  suggested  that  metabolic  scaling  might  be  a  consequence  of  the  fractal  na- 
ture of  biology  and  WBE  determined  that  fractal  geometry  maximizes  the  efficiency 
of  nutrient  transport  in  biological  networks.  Weibel^^^  maintains  that  the  fractal 
design  principle  can  be  observed  in  all  manner  of  physiologic  networks  quantifying 
the  observations  and  speculations  of  Mandelbrot, ^^^'^^^  as  does  West.^^^ 

West^^^  conjectured  that  fractals  are  more  adaptive  to  internal  changes  and  to 
changes  in  the  environment  than  are  classical  processes  and  structures.  Consider  a 
network  property  characterized  by  classical  scaling  at  the  level  k  such  as  the  length 
or  diameter  of  a  branch  oc  e~^^  compared  with  a  fractal  scaling  characterization 
of  the  same  property  oc  k~^.  What  is  significant  in  these  two  functional  forms  for 
the  present  argument  is  the  dependence  on  the  parameter  A.  The  exponential  has 
emerged  from  a  large  number  of  optimization  arguments  and  the  inverse  power-law 
results  from  RG  scaling  arguments. 

Assume  the  parameter  A  is  the  sum  of  a  constant  part  Aq  and  a  random  part  ^. 
The  random  part  can  arise  from  unpredictable  changes  in  the  environment  during 
morphogenesis,  non-systematic  errors  in  the  code  generating  the  physiologic  struc- 
ture or  any  of  a  number  of  other  causes  of  irregularity.  Thus,  regardless  of  whether 
the  errors  are  induced  internally  or  externally,  the  average  is  taken  over  an  ensemble 
of  zero-centered  Gaussian  fluctuations  ^  with  variance  (j^/2.  Note  that  the  choice 
of  Gauss  statistics  has  no  special  significance  here  except  to  provide  closed  form 
expressions  for  the  averages  to  facilitate  discussion.  The  relative  error  generated  by 
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Fig.  8.  The  error  between  the  model  prediction  and  the  prediction  averaged  over  a  noisy  param- 
eter is  shown  for  the  classical  model  (upper  curve)  and  the  fractal  model  (lower  curve). 

the  fluctuations  is  given  by  the  ratio  of  the  average  value  to  the  function  in  the  ab- 
sence of  fluctuations,  yielding  the  relative  error  for  classical  scahng  £k  =  exp[cr^/c^] 
and  for  fractal  scaling  Sk  =  exp[cr^(ln /c)^]. 

The  two  error  functions  are  graphed  in  Fig.  8  for  fluctuations  with  a  variance 
cr^  =  0.01.  At  /c  =  15  the  error  in  classical  scaling  is  9.5.  This  enormous  relative  error 
means  that  the  perturbed  average  property  at  generation  15  differs  by  nearly  an 
order  of  magnitude  from  what  it  would  be  in  an  unperturbed  network.  A  biological 
network  with  this  sensitivity  to  error  would  not  survive  for  very  long  in  the  wild.  For 
example,  the  diameter  of  a  bronchial  airway  in  the  human  lung  could  not  survive 
this  level  of  sensitivity.  However,  the  property  of  the  fractal  network  only  changes 
by  10%  at  the  distal  point  k  =  20.  The  implication  is  that  the  fractal  network  is 
relatively  unresponsive  to  fluctuations. 

A  fractal  network  is  consequently  very  tolerant  of  variability.  This  error  tolerance 
can  be  traced  back  to  the  broadband  nature  of  the  distribution  in  scale  sizes  of  a 
fractal  object.  This  distribution  ascribes  many  scales  to  each  generation  in  the 
network.  The  scales  introduced  by  the  errors  are  therefore  already  present  in  a 
fractal  object.  Thus,  the  fractal  network  is  preadapted  to  variation  and  is  therefore 
insensitive  to  change. ^^^'^^^  These  conclusions  do  not  vary  with  modification  in  the 
assumed  statistics  of  the  errors.  Therefore  let  us  review  how  fractal  statistics  have 
been  employed  in  the  understanding  of  some  nonmetabolic  physiologic  ARs. 

4.3.1.  WBG  bronchial  tree 

The  energy  minimization  argument  applied  to  a  branching  network  resulted  in 
Horton's  law.  However,  the  theoretical  arguments  justifying  this  empirical  law  and 
others  like  it  assume  that  the  network  is  geometrically  self-similar.  Real  networks, 
such  as  the  bronchial  tree  in  the  mammalian  lung,  do  not  have  such  determinis- 
tic regularity.  At  any  generation  of  the  bronchial  airway  there  is  a  distribution  of 
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lengths  and  diameters,  with  a  reduction  in  the  average  length  and  average  diam- 
eter between  successive  generations.  Consequently,  the  simple  characteristic  scale 
governing  the  decrease  in  bronchial  dimensions  across  generations  given  by  the 
Hess-Murray  law  must  be  reexamined.  West,  Barghava  and  Goldberger^^^  (WBG) 
assumed  the  mammalian  lung  to  be  a  fractal  structure  having  a  distribution  of 
scales  contributing  to  the  variability  in  tube  size  at  each  generation.  This  model 
yields  an  average  bronchial  diameter  that  decreases  with  generation  number,  not 
as  an  exponential  as  suggested  by  Weibel  and  Gomez, but  as  an  inverse  power 

law.229^232,233 

WBG  assume  that  the  variability  in  the  diameters  of  the  bronchial  tubes  at 
generation  k  is  given  by  d{jk)^  where  7  is  a  random  variable  that  determines  the 
size  of  the  diameter.  In  the  energy  minimization  argument  we  determined  that 
7  =  In  2/3  and  the  classical  reduction  in  diameter  with  generation  number  was 
exponential  d{jk)  =  doe~^^  as  shown  in  Fig.  7.  However  with  real  data  this  is  the 
reduction  in  the  average  diameter  just  as  it  was  for  other  allometry  networks  so  that 
the  random  variation  in  diameter  at  generation  k  is  smoothed  over  to  determine  the 
average.  Formally  the  behavior  of  the  average  diameter  as  a  function  of  generation 
number  is  given  by: 

d(k)  =  J  d{jk)p{j)dj,  (38) 

with  ^(7)  the  pdf  in  scale  size.  Rather  than  prescribe  a  particular  functional  form  to 
this  pdf  in  the  formal  definition  WBG  use  a  RG  argument  originally  developed  in  an 
economic  context  by  Montroll  and  Shlesinger.^^^  Assuming  the  original  distribution 
of  scales  has  an  average  value  7  the  RG  argument  constructs  a  new  distribution  with 
an  infinite  number  of  new  scales  each  a  factor  of  /3  larger  than  the  preceding  scale 
and  each  occurring  with  a  relative  frequency  (  less  often.  The  distribution  resulting 
from  the  RG  relation  gives  rise  to  an  empirical  average  denoted  by  brackets  in  terms 
of  the  formal  average  Eq.  (38)  as  follows^^^'^^^: 

{d{k))  =  cidm)  +  -  om  ■  (39) 

The  dominant  behavior  for  the  solution  to  the  RG  relation  Eq.  (39)  is  determined 
by  the  singular  part  of  the  solution  to  the  equation,  which  for  simplicity  WBG  write 
with  a  subscript  {d{k))s  =  C{d{Pk))s.  Shlesinger  and  West^^^  note  that  the  solution 
to  this  scaling  equation  that  provides  the  best  fit  to  the  data  can  be  written  as  the 
real  part  of: 

^  A 

{d{k))s=  ^n  =  b^i27rn/\n/3;  (40) 

n—  —  oo 

jin  is  the  complex  fractal  dimension  of  order  n  for  the  branching  process,  with  the 
exponent  h  =  —  lnC/ln/3  >  0  and  the  An  are  fit  by  experimental  data.  Of  course 
there  is  in  addition  to  Eq.  (40)  an  analytic  part  to  the  solution  that  WBG  calculated 
but  which  becomes  negligible  with  increasing  k. 
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Thus,  the  average  diameter  is  an  inverse  power  law  in  the  generation  number 
modulated  by  a  slowly  oscillating  function  just  as  observed  in  the  data  depicted  in 
Fig.  9.  This  fractal  model  of  the  bronchial  airway  provides  an  excellent  fit  to  the 
average  bronchial  tube  diameter  data  in  four  distinct  species:  dogs,  rats,  hamsters 


Fig.  9.  Top:  Reploted  data  from  Fig.  7  on  log-log  scales  for  humans.  Center:  The  harmonic 
variation  in  measurements  for  dogs,  rats  and  hamsters  taken  by  Raabe  et  al.  (1976).  Bottom:  The 
average  diameters  for  the  bronchial  tubes  of  a  hamster  and  RG  (fractal)  model  prediction  using 
Eq.  (40).  [Adapted  from  West  and  Goldberger228]. 
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and  humans.  The  quahty  of  the  fit  is  shown  for  hamsters  in  Fig.  9,  which  restricts 
the  summation  index  in  Eq.  (40)  to  n  =  0,1,-1  and  Ai  =  A-i  yields  a  single 
oscillation  of  the  modulation  function  fit  to  the  data. 

The  form  of  Eq.  (40)  strongly  suggests  that  the  RG  relation  for  the  average  di- 
ameter captures  a  fundamental  property  of  the  structure  of  the  lung  that  is  distinct 
from  classical  scaling.  There  is  an  AR  at  each  level  of  the  bronchial  tree  with  the 
average  diameter  decreasing  as  with  the  level  being  a  measure  of  size.  Moreover 
the  data  show  the  same  type  of  scaling  for  bronchial  tube  lengths  and  consequently 
volume. 


4.3.2.  Distribution  of  bronchial  diameters 

Kitaoka  and  Suki^^''  use  a  related  but  distinct  analysis  of  the  lung  data  depicted 
in  Fig.  9  in  which  they  investigate  the  statistical  variability  of  the  bronchial  airway 
diameters.  Their  analysis  was  based  on  the  assumed  relation  between  the  airflow 
rate  and  the  airway  diameter  d: 

Q  =  Cd'', 

a  relation  discussed  in  terms  of  energy  optimization  by  Murry^^^  yielding  a  =  3. 
The  relation  was  also  considered  in  a  bronchial  airway  context  by  a  number  of  other 
investigators. ^^'^^^'^^^  The  data  of  Raabe  et  al}^^  reveals  that  the  flow  rate  is  a 
random  variable  and  that  the  cumulative  probability  for  the  flow  rate  to  exceed  Q 
is  given  by: 

Pr(>  Q)  cx  1/Q 

with  an  inverse  power-law  index  of  —1  to  three  significant  figures.  Note  that  from 
the  additive  form  of  the  flow  rate  between  successive  generations  and  the  da  Vinci 
relation  between  parent  and  daughter  branches  that  the  pdf  for  the  diameter  of  the 
bronchial  airway  is  determined  to  be: 

p(d)=Cd-("+^    C7=^^2^  (41) 

with  do  the  diameter  of  the  thorax  and  dr  the  diameter  of  the  smallest  bronchial 
airway.  Here  again  the  empirical  distribution  for  the  observable,  the  bronchial  tube 
diameter  rather  than  the  allometry  coefficient,  is  an  inverse  power-law  (Pareto)  pdf. 

Note  that  in  the  previous  subsection  the  average  diameter  was  expressed  as  an 
inverse  power  law  in  the  generation  number.  The  distribution  in  the  size  of  the 
bronchial  airway  Eq.  (41)  does  not  directly  depend  on  generation  number  and  is  an 
inverse  power  law  in  diameter  size,  that  is,  in  millimeters  as  depicted  in  Fig.  10.  The 
data  used  to  construct  Fig.  9  was  also  used  to  determine  the  cumulative  distribution 
of  airway  diameters  and  yields  the  inverse  power-law  distribution  with  a  =  3.1  with 
correlation  coefficient  of  =  0.988.  Note  that  this  value  of  the  exponent  is  slightly 
different  from  the  optimum  as  we  discuss  in  the  next  section.  It  is  also  noteworthy 
that  the  oscillations  around  the  inverse  power  law  given  by  the  straight  line  are  the 
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Fig.  10.  Cumulative  distribution  of  diameters  on  log-log  plot.  The  regression  model  over  the  full 
range  of  diameters  yields  o;  =  3.1  with  =  0.988.  [Reproduced  from  Kitaoka  and  Suki-'^^'^  with 
permission] . 

same  periodic  variation  explained  using  WBG  theory  except  that  the  oscihations 
are  now  a  function  of  distance  rather  than  generation  number. 

4.3.3.  The  optimum  can  he  dangerous 

A  question  of  interest  is  whether  optimal  design  criteria  are  ever  realized  in  nature 
and  whether  they  are  even  desirable.  The  WBE  model  suggests  that  the  allometry 
exponent  value  of  3/4  is  proof  of  the  optimally  of  fractal  design  of  networks  for  nu- 
trient transport.  However  the  controversy  over  the  empirical  value  of  the  allometry 
exponent  calls  this  implication  into  question.  The  present  discussion  regarding  the 
mammalian  lung  and  whether  the  bronchial  tree  is  optimal  has  been  investigated 
by  Mauroy  et  al}^^  They  maintain  that  the  bronchial  tree  of  most  mammalian 
lungs  is  a  good  example  of  an  efficient  distribution  network  with  an  approximate 
fractal  structure. ^^^'^^^  They  state  that  physical  optimization  is  critical  in  that 
small  variations  in  the  geometry  can  induce  large  variations  in  the  net  air  flux  and 
consequently  optimality  cannot  be  a  sufficient  criterion  for  physiologic  design  of  the 
bronchial  tree.  The  slight  deviations  observed  in  the  parameters  presumed  to  be 
optimized  are  a  manifestation  of  a  safety  factor  being  encorporated  into  the  design 
and  into  the  capacity  for  regulating  airway  caliber. 
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In  the  present  context  the  size  ratio  h  of  successive  airway  segments  are  homoth- 
etic  with  h  =  2~-^/^  «  0.79  as  discussed  earher.  Homothetic  scahng  means  that  the 
lengths  and  diameters  have  the  same  ratios  between  successive  generations.  Using 
the  resistance  minimization  argument  for  the  bronchial  network  Mauroy  et  al}^^ 
show  that  the  "best"  bronchial  tree  is  fractal  with  constant  reduction  factor  given 
by  the  Hess-Murray  law.  Do  the  data  support  this  optimal  value  and  if  not  what 
does  that  imply  about  the  efficiency  of  bronchial  airways? 

The  fractal  dimension  for  a  bronchial  airway  is  D  =  —  In  2/  In  so  that  the 
Hess-Murray  law  implies  D  =  3  whereas  h  >  0.79  implies  D  >  3.  In  the  human 
lung  it  is  found  that  the  homothety  ratio  is  h  ^  0.85^^^  and  the  bronchial  network 
is  therefore  not  optimized:  its  volume  is  too  large  and  its  overall  resistance  is  too 
small.  Mauroy  et  al}^^  emphasize  that  this  deviation  from  optimal  is,  in  fact,  a 
safety  margin  for  breathing  with  respect  to  possible  bronchial  constrictions. 

Sapoval^^*"  has  argued  that  without  regulation  of  the  airway  caliber^''^  there 
would  be  a  multifractal  spatial  distribution  of  air  within  the  lungs,  resulting  in 
strongly  nonuniform  ventilation  with  some  regions  of  the  lung  being  poorly  fed 
with  fresh  air.  Expanding  on  this  theme  using  inhomogeneity  of  the  homothety 
ratio  Mauroy  et  al}^^  show  how  the  optimal  network  is  dangerously  sensitive  to 
physiological  variability  and  consequently  design  of  the  bronchial  tree  must  incor- 
porate more  than  just  physical  optimality.  This  argument  has  clear  implications  for 
other  allometric  networks. 


5.  Probability  Calculus  and  ARs 

A  fractal  processes  is  one  that  is  rich  in  scales  with  no  one  scale  being  dominant. 
Thus,  information  in  fractal  phenomena  is  coupled  across  multiple  scales  manifest- 
ing long-time  memory,  as  for  example,  observed  in  the  architecture  of  the  mam- 
malian lung,^^^'^^^'^^^  manifest  in  the  long-range  correlations  in  human  gait^^'^^^ 
and  measured  in  the  human  cardiovascular  network, all  of  which  are  discussed  in 
West.^^^  These  phenomenological  characterizations  of  fractal  time  series  relate  back 
to  the  observation  made  earlier  that  if  X{t)  and  Y{t)  are  stochastic  time  series  then 
Y{X)  must  be  a  random  function  of  its  random  argument  as  well.  Consequently 
the  general  approach  to  determining  the  relation  between  such  variables  is  through 
their  pdfs. 

In  this  section  we  examine  pdfs  that  can  produce  the  interspecies  metabolic 
ARs  as  a  relation  between  moments.  To  do  this  requires  a  brief  review  of  some 
methods  from  nonequilibrium  statistical  physics^ ^^'^^^  and  their  extension  to  frac- 
tional equations  of  motions. ^^^'^^^ 


5.1.  Stochastic  differential  and  Fokkei — Planck  equations 

There  are  two  major  techniques  available  in  statistical  physics  for  the  modeling  of 
stochastic  phenomena.  The  first  technique  uses  the  dynamic  equations  constructed 
by  Langevin^^^  who  introduced  uncertainty  through  a  random  force  in  the  equations 
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of  motion.  The  second  technique  is  based  on  the  phase  space  evolution  of  the  pdf 
using  the  Fokker-Planck  equation  (FPE).  The  conditions  under  which  these  two 
methods  are  equivalent  have  been  shown  in  a  number  of  places,  see,  for  example, 
Lindenberg  and  West.^^^ 


5.1.1.  Additive  fluctuations 

Consider  the  dynamics  of  a  simple  exponential  relaxation  process  Z(t)  that  is  dis- 
rupted by  a  random  force  ^(t): 

dZ(t) 


dt 


-xz{t)^m- 


(42) 


We  assume  the  statistics  of  the  random  force  are  normal  with  a  delta  correlated 
correlation  of  strength  D 


mm)  =  ^mt  -  f)  ■ 


(43) 


An  alternative  description  of  the  dynamics  is  given  by  the  evolution  of  the  prob- 
ability density  in  phase  space  where  P{z^t)dz  is  the  probability  that  the  dynamic 
variable  Z{t)  has  a  value  in  the  interval  {z^  z  -\-  dz)  at  time  t  given  a  value  q{t)  at 
the  origin  2:  =  0.  The  phase  space  dynamics  are  formally  expressed  by  the  equation: 


G 


(44) 


The  analytic  function  G{-^-)  of  the  indicated  operators  along  with  the  inhomoge- 
neous  term  q{t)  determine  the  dynamics  of  the  process  modeled  by  the  pdf. 

A  phase  space  equation  for  a  classical  diffusion  process  with  linear  dissipation 
equivalent  to  Eq.  (42)  is  determined  by: 


a 


d_  d_ 

dt^  dz 


d_ 

dt 


d_ 

dz 


Xz^D 


d_ 

dz 


The  corresponding  FPE  is  given  by^^^: 
dP{z,t\zo,to)  d 


dt 


dz 


Xz 


^  dz 


and    q(t)  =  1 . 


P{z,t\ZQ,tQ)  , 


(45) 


(46) 


where  P{z^t\zQ^tQ)dz  is  the  probability  that  the  dynamic  variable  lies  in  the  phase 
space  interval  {z  +  dz^z)  at  time  t  conditional  on  Z(0)  =  zq  at  time  t  =  to-  The 
solution  to  Eq.  (46)  for  to  =  0  is  given  by: 


P{z,t\zo)  = 


exp 


iz-{z;t))' 


with  average  value  {z\t)  =  zqc       and  variance  cr  (t)  =  D/\{1 


(47) 

-^^).  In  the 

absence  of  dissipation  the  average  value  is  constant  and  the  variance  grows  linearly 
in  time.  However  for  finite  dissipation  the  average  decays  exponentially  in  time 
and  the  variance  approaches  a  constant  value  given  by  the  ratio  of  the  diffusion 
coefficient  and  the  dissipation  rate.  If  Z  is  the  velocity  of  a  Brownian  particle  the 
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variance  would  be  proportional  to  the  temperature  of  the  surrounding  fluid  resulting 
in  the  Einstein  relation  D/X  =  ksT^  see,  for  example,  Fiirth.^^ 

Multiplicative  fluctuations  can  be  addressed  by  examining  the  FPE  associated 
with  a  linear  stochastic  dissipation  parameter. ^^^'^^^  Consider  the  nonlinear  rate 
equation  with  multiplicative  fluctuations 

^  =  -AXlnX  +  e(t)X  (48) 

for  which  the  logarithmic  transformation  Z  =  \nX  yields  the  rate  equation  with 
additive  fluctuations  ^(t)  given  by  Eq.  (42).  The  FPE  for  the  transformed  equation 
is  Eq.  (46)  with  the  solution  Eq.  (47).  In  terms  of  the  original  variable  the  pdf  is  a 
log- normal  distribution,  for  more  details  see  Goel  et  al7^ 

5.1.2.  Stochastic  ontogenetic  growth  model 

A  more  interesting  rate  equation  was  first  considered  by  von  Bertalanffy^^^  who 
postulated  a  simple  nonlinear  rate  equation  to  describe  the  growth  of  TBM  m  of 
the  form 

^  =  A^m^  -  C^m^  (49) 

at  time  t  where  rj  and  /3  are  unspecified  exponents,  and  and  C/^  are  positives 
constants.  The  solution  to  this  equation  was  studied  by  von  Bertalanffy^^^  with 
T]  =  2/3  and  /3  =  1.  More  recently  West  et  al?^^  constructed  Eq.  (49),  called  an 
ontogenetic  growth  model  (OGM),  from  a  conservation  of  energy  argument  and 
obtained  a  universal  curve  from  the  solution  and  fit  it  to  data  by  using  7^  =  3/4 
and  /3  =  1.  In  the  OGM  the  parameters  are  given  by  Cfs  =  Bm/Em  with  Bm  the 
metabolic  rate  required  to  maintain  an  existing  unit  of  biomass;  Em  the  metabolic 
energy  required  to  create  a  unit  biomass  and  =  a/ Em-  Banavar  et  al}^  obtained 
an  equivalent  universal  curve  and  fit  the  same  data  for  both  77  =  2/3,  3/4  and  /3  =  1. 
The  OGM  equation  has  the  scaling  form: 

rjrn 

—  =  m^f{m/Mo) ,  (50) 

where  Mq  is  chosen  to  be  the  maximum  TBM  that  solves  the  stationary  equation 
dm/dt  =  0.  The  data  from  13  organisms  graphed  as  r  =  (m/Mo)^~^  versus  r  = 
—  ln(l  —  r)  collapse  onto  a  single  universal  curve^^^  that  is  fit  equally  well  with 
77  =  2/3  or  77  =  3/4.10 

West  and  West^^^  generalized  the  OGM  to  incorporate  the  disordering  influ- 
ence of  entropy  by  including  the  statistical  fluctuations  of  the  measured  mass.  The 
dynamics  of  such  a  stochastic  process  eventually  erases  the  influence  of  the  initial 
state  and  this  gradual  loss  of  information  is  a  manifestation  of  increasing  entropy. 
They  refer  to  this  as  the  stochastic  ontogenetic  growth  model  (SOGM).  Consider  a 
phenomenological  Langevin  equation  with  multiplicative  fluctuations  for  the  TBM 
where  to  avoid  confusion  we  use  the  dummy  variable  Z  =      which  is  actually  the 
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TBM  for  species  i  and  individual  j  and  individual  j  given  by  Mij.  Thus,  we  replace 
Eq.  (49)  ioi  T]  =  h  and  /3  =  1  for  the  SOGM  stochastic  differential  equation: 

^  =  AbZ"  -  CoZ  +         ,  (51) 

with  £^(t)  given  by  a  delta  correlated  random  process  of  strength  D  with  normal 
statistics  and  correlations  given  by  Eq.  (43).  Rather  than  assuming  the  rate  of 
creating  a  unit  biomass  is  a  single  number  Ci  they  assume  it  has  a  stochastic 
component:  Ci  ^  Co  +  ^(t),  with  average  value  Cq. 

An  equivalent  description  of  the  SOGM  is  given  by  the  corresponding  dynamics 
of  the  pdf  in  phase  space.  The  FPE  corresponding  to  Eq.  (51)  is^^^'^^^: 


dP{z,t)  d_ 
dt  dz 


-{A,z''-Coz)^Dz^z 


P{z,t).  (52) 


The  general  solution  to  Eq.  (52)  remains  a  mystery,  but  the  steady-state  solution 
{t  ^  oo)  denoted  by  the  ss  subscript  obtained  by  setting  the  probability  flux  to 
zero  is  given  by: 

p..w  =  ^i^££t?:3;  (53) 


normalized  on  the  interval  (0,  oo)  with  the  parameter  values  7  =  Ai)//3D^  ji  = 
1  +  {Co/D)  and  /3  =  1  -  6.  Note  that  the  steady-state  solution  to  the  SOGM  FPE 
replaces  the  deterministic  steady-state  solution  obtained  from  the  OGM.^^'^^^ 

West  and  West^^^  fit  the  parameters  in  Eq.  (53)  to  a  data  set  of  mammalian 
species  tabulated  by  Heusner^^  and  this  fit  is  depicted  in  Fig.  11.  They  constructed 
a  histogram  by  partitioning  the  mass  axis  into  intervals  of  20  g  and  counting  the 
number  of  species  within  each  interval.  The  vertical  axis  is  the  relative  number  of 
species  as  a  function  of  TBM.  The  allometry  exponet  is  fixed  at  6  =  3/4  so  that 
/3  =  1/4  and  the  remaining  parameters  are  determined  to  have  the  best  values  of 
7  =  13.4  and  /i  =  2.04  with  a  quality  of  fit  measured  by  the  correlation  coefficient 

=  0.96. 

Figure  11  shows  the  fit  to  the  low-mass  species  out  to  500  g.  The  remaining  fit 
out  to  three  million  grams  is  not  shown  but  is  in  fact  the  more  ineresting  part  of  the 
distribution.  To  capture  this  information  on  a  single  graph  we^^^  construct  a  second 
histogram,  this  one  for  the  asymptotic  region  from  approximately  one  thousand  to 
three  million  grams.  Figure  12  depicts  the  fit  to  the  logarithmic  histogram  data 
points  indicated  by  the  dots  starting  at  a  TBM  of  1.1  kg.  An  inverse  power-law 
distribution  would  be  a  straight  line  with  a  negative  slope  on  this  log-log  graph. 
Inserting  the  parameter  values  /i  =  1.67  and  7  —  8.96  into  the  steady-state  TBM 
pdf  given  by  Eq.  (53)  yields  the  solid  cuve  in  Fig.  12  which  fits  the  data  extremely 
well.  The  curve  is  quite  clearly  an  inverse  power  law  in  the  interspecies  TBM. 
This  coarse-grained  description  of  the  interspecies  TBM  statistics  indicates  great 
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Fig.  11.  The  histogram  constructed  from  the  average  TBM  data  for  391  mammahan  species^^ 
is  given  by  the  dots.  The  mass  data  has  been  divided  into  intervals  of  20  g  each  and  the  number 
of  species  within  each  such  interval  counted.  The  horizontal  axis  is  the  relative  frequency  in  each 
interval  and  the  solid  line  segment  is  the  least  squares  fit  of  Eq.  (53)  to  the  data  points.  The 
quality  of  fit  is      =  0.96.^46 
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Fig.  12.  The  average  TBM  data  for  391  mammalian  species^-*^  are  used  to  construct  a  histogram. 
The  mass  interval  is  divided  into  20  equally  spaced  intervlas  on  a  logarithm  scale  and  the  number 
of  species  within  each  interval  counted.  The  least-square  fit  to  the  nine  data  points  is  then  made 
using  logrimically  transformed  distribution,  see  Ref.  246  for  details.  The  quality  of  the  fit  is 
r2  =  0.998. 

variability  particularly  since  fi  <  2  indicating  that  the  variance  of  the  interspecies 
TBM  diverges. 

5.1.3.  Interspecies  AR  derivation 

^^241, 243  explicitly  constructed  a  statistical  theory  of  the  fluctuations  in  the  mea- 
sured species  metabolic  rate  Bi  and  TBM  Mi  to  calculate  the  interspecies  metabolic 
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AR.  The  strategy  was  to  related  the  average  BMR  to  the  average  TBM  through  the 
AR.  This  approach  can  be  followed  here  from  the  long-time  or  steady-state  average 
SOGM  TBM  using  the  steady-state  pdf: 


f 

Jo 


zPss{z)dz 


exp[— 7Z  ^]dz 


and  replacing  z  with  the  TBM  for  species  i  reduces  the  average  to: 


(3  (3 


7- 


(54) 


Here  we  interpret  the  ensemble  average  in  terms  of  an  average  over  an  ensemble  of 
individual  members  of  species  i.  In  this  equation  we  have  approximated  the  ratio 
of  gamma  functions  by  an  asymptotic  value  and  substituting  the  parameter  values 
into  the  right-hand  side  of  Eq.  (54)  yields: 

/  ^  \  1/^ 

(55) 


The  equality  in  Eq.  (55)  is  obtained  by  substituting  the  values  of  the  parameters 
from  OGM  and  using  6  =  3/4  and  /3  =  1/4  to  yield 

a(Mi)-i/^  (56) 


Br. 


Thus,  the  average  metabolic  rate  necessary  to  maintain  a  unit  biomass  is  dependent 
on  the  inverse  quarter  power  of  the  average  TBM  of  the  species.  This  is  the  same 
expression  obtained  by  Moses  et  al}^^  if  we  identify  their  adult  TBM  with  the 
average  TBM  obtained  from  the  steady-state  pdf  of  the  SOGM. 

Note  that  the  average  TBM  for  species  i  replaces  Mq  obtained  from  the 
dm/dt  =  0  solution  to  Eq.  (50).  On  the  other  hand  the  average  BMR  can  be  deter- 
mined from  the  6th  moment  of  the  TBM 


{B,)=a{M^)=a  /  z'Pss{z)dz 
Jo 


a7 


/3 


1 


7- 


1 


1//3-1 


(57) 


Comparing  this  last  equation  with  Eq.  (54)  allows  us  to  write: 

{B,)  ^  a{M,)^-^  =  a{M,)\  (58) 

which  is  the  interspecies  metabolic  AR.  We  emphasize^^^  that  the  interspecies  AR 
is  a  pheomenological  equation  that  without  the  SOGM  does  not  have  a  formal 
theoretical  basis. 
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The  SOGM  constitutes  the  first  theoretical  construction  of  an  interspecies 
metabohc  AR  starting  from  a  fundamental  dynamic  equation  and  relating  the 
proper  averages.  It  is  true  that  we  used  a  nonlinear  multiplicative  Langevin  equa- 
tion, but  the  original  dynamics  stem  from  the  conservation  of  energy  argument^^^ 
and  the  fluctuations  are  a  consequence  of  the  dynamics  being  degraded  by  en- 
tropy.     This  suggests  a  possible  untraveled  path  for  future  research  in  this  area. 

5.2.  Scaling  solution  for  AR 

A  phase  space  equation  for  anomalous  diffusion  containing  the  influence  of  long- 
term  memory  on  the  dynamics  of  the  pG?/was  flrst  discussed  by  West  and  Seshadri^^ 
in  term  of  the  fractional  calulus.  The  resulting  fractional  diffusion  equation^^^'^^^ 
(FDE)  has  subsequently  found  a  variety  of  uses  in  the  physical  sciences.  The  pdf 
that  solves  a  class  of  FDEs  satisfles  the  scaling  equation 


Note  that  this  scaling  of  the  pdf  is  also  a  realization  of  the  RG  relation  for  the 
random  variable  Z{bt)  =  b^Z{t). 

The  function  Fz{-)  in  Eq.  (59)  is  left  unspecified  but  it  is  analytic  in  the  similarity 
variable  z/t^.  As  mentioned  in  the  introduction  a  standard  diffusion  process  Z{t) 
is  the  displacement  of  the  diffusing  particle  from  its  initial  position  at  time  and 
for  vanishing  small  dissipation  the  scaling  parameter  is  /i;^  =  1  /2  and  the  functional 
form  of  Fz{-)  is  a  normal  distribution.  However,  for  general  complex  phenomenon 
there  is  a  broad  class  of  distributions  for  which  the  functional  form  of  Fz{-)  is 
not  Gaussian  and  the  scaling  index  fiz  ^  1/2.  For  example,  the  a-stable  Levy 
process"*^^^'"*^^^'^^^'^^^  scales  in  this  way  and  the  Levy  index  is  in  the  range  0  <  a  < 
2,  with  the  equality  holding  for  the  Gauss  distribution  and  the  scaling  index  is 
related  to  the  Levy  index  by  fiz  =  l/<^,  see  West,  Geneston  and  Grigolini^^^  and 
Uchaikin^-*^^  for  very  different  discussions  of  this  scaling. 

The  Gibbs  entropy  (Shannon  information), ^^'^^^  can  be  defined  using  the  pdf 
for  the  variable  Z{t)  Eq.  (59)  to  obtain: 


so  the  entropy  deviation  from  the  reference  state  =  —  J Fz{q)  In  Fz{q)dq  inte- 
grated with  respect  to  the  scaled  variable  q  =  z/t^^^  increases  logarithmically  in 
the  independent  variable  t. 

Huxley^^  assumed  the  differential  growth  equations  in  an  organism  to  have 
the  same  form  with  proportional  growth  rates.  West  and  West^^^'^^^  adapted  this 
assumption  and  presumed  the  two  parts  of  an  organism  share  the  same  class  of 
probability  densities  that  describe  their  interacting  observables.  Here  t  is  clock  time, 
the  independent  variable,  and  the  network  measures  are  the  dependent  variables. 


(59) 


(60) 
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the  average  of  a  generic  scaling  observable  using  the  pdf  given  by  Eq.  (59)  yields: 

{Z{t))=  J  zP{z,t)dz  =  t^^  J  qF,{q)dq  =  t^^ Z  .  (61) 

Note  that  Z  =  JqFz{q)dq  is  a  finite,  time-independent  constant.  Comparing 

Eqs.  (61)  and  (60)  enables  us  to  write  for  the  change  in  entropy  relative  to  a 
reference  state: 

AS,  =  \n[{Z{t))/Z]  (62) 

SO  that  the  change  in  the  average  value  of  an  observable  is  determined  by  the  change 
in  entropy 

{Z{t))  =  Ze^^^(^)  .  (63) 

Consequently  Z  may  be  identified  with  either  of  the  network  variables  to  obtain  the 
relation  between  entropies  from  Eq.  (60) 

and  using  Eq.  (63)  we  can  write: 

{¥,)  =  Ye^^y  =  Ye^^^^  =  a{X,)^  (64) 


with  the  allometry  coefficient  given  by  a  =  Y /X^  and  the  allometry  exponent  by 
b  =  fly/ fix-  We  have  again  introduced  the  index  for  the  single  species  i  to  emphsize 
that  this  is  the  interspecies  AR.  This  derivation  of  the  empirical  interspecies  AR  is 
solely  a  consequence  of  the  scaling  properties  of  the  pdf^s. 

The  rate  of  entropy  generation  is  determined  by  the  time  derivative  of  Eq.  (63) 
for  X  and  Y  and  substituting  from  the  entropy  balance  equation  yields: 

1     d{X,)  _      1     d{Y,)  ^^^^ 


IJix{Xi)    dt        /^yiYi)  dt 

Equation  (65)  shows  the  correspondence  of  the  probabilistic  approach  to  AR  with 
that  of  Huxley,  with  the  dynamic  variables  of  individual  species  members  replaced 
by  averages  over  an  ensemble  of  individuals.  In  the  present  case  the  allometry  ex- 
ponent is  now  the  ratio  of  the  power-law  indices  in  the  pdf^s,  which  from  Eq.  (65) 
can  also  be  interpreted  as  the  ratio  of  growth  rates.  Moreover,  the  allometry  coeffi- 
cient is  determined  by  the  scaling  functions  in  the  pdf^s.  It  is  the  average  response 
or  adaptation  of  the  subnetwork  to  the  average  behavior  of  the  host  network  that 
is  captured  by  the  interspecies  AR  through  the  balance  of  the  entropy  generated. 
Moreover  it  is  the  scaling  in  the  pdf  and  not  necessarily  geometric  scaling  of  fractal 
networks  that  is  the  origin  of  the  physiologic  AR.  Underlying  this  derivation  is  the 
assumption  made  earlier  that  this  is  an  information-dominated  network  and  is  con- 
sequently driven  by  information  (entropy)  gradients  and  not  the  energy  gradient 
necessary  in  the  fractal  geometry  derivation  of  nutrient  transport. 
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6.  Conclusion 

The  statistical  analysis  of  the  metabolic  data  presented  herein  show  that  empir- 
ical ARs  exist  across  multiple  species  and  consequently  the  form  of  ARs  are  not 
solely  dependent  on  species  specific  mechanisms.  Moreover  we  have  shown  there  is 
no  universal  value  for  the  allometry  exponent  for  three  reasons.  First  there  is  the 
variability  in  the  values  of  the  allometry  exponent  obtained  from  data  analysis  in 
Sec.  3.  We  could  rehash  the  arguments  here,  but  an  objective  observer  would  have 
to  conclude  that  although  there  are  indications  that  a  particular  value  might  be 
strongly  indicated  in  a  specific  context,  the  evidence  for  universality  is  not  com- 
pelling. Even  in  the  case  of  metabolic  ARs  one  could  reasonably  make  a  case  of 
b  =  2/3  for  small  animals,  b  =  3/4  for  large  animal  and  6  =  1  for  plants,  but  no 
one  value  of  b  that  spans  the  total  range  of  animal  and  plant  sizes.  Second,  the 
only  theories  that  predict  a  universal  value  of  the  allometry  exponent  do  so  to  ex- 
plain the  intraspecies  AR  and  not  the  interspecies  AR.  There  is  no  first  principles 
theory  that  derives  the  empirical  AR  between  the  averaged  observables.  The  phe- 
nomenological  theory  presented  in  Section  4  successfully  derives  the  interspecies 
AR  and  treats  the  allometry  coefficient  and  allometry  exponent  as  empirical  pa- 
rameters determined  by  the  parameters  in  the  pdfs.  Third  and  last  the  allometry 
exponent  and  coefficient  are  determined  to  co-vary  using  theory^^^  and  statistical 
data  analysis. 

However  let  us  not  be  too  critical  of  phenomenological  theories.  Recall  that 
thermodynamics  is  a  phenomenological  theory  that  has  enjoyed  remarkable  suc- 
cess in  explaining  a  plethora  of  complex  physical  phenomena.  Even  so  the  physics 
community  has  not  yet  been  able  to  derive  thermodynamics  from  the  more  reduc- 
tionist (fundamental)  statistical  mechanics,  so  that  a  mechanistic  understanding 
of  thermodynamics  remains  controversial.  By  the  same  token  the  phenomenolog- 
ical theory  of  AR  presented  herein  takes  the  discussion  of  AR  out  of  the  domain 
of  the  reductionist  approaches  previously  used  to  derive  the  intraspecies  ARs  and 
refocuses  it  on  the  statistical  properties  of  the  empirical  interspecies  ARs.  These 
analyses  indicate  a  new  avenue  for  the  study  of  physiologic  ARs,  one  that  system- 
atically includes  both  deterministic  and  statistical  aspects  using  the  probability 
calculus.  This  new  perspective  indicates  that  the  origins  of  physiologic  AR  reside 
in  the  scaling  properties  of  the  pdf^s  for  nested  complex  networks. 

In  simple  physical  phenomena  the  dynamics  of  a  network  coupled  to  the  en- 
vironment is  described  by  a  Langevin  equation,  which  is  a  stochastic  differential 
equation.  This  mechanical  description  of  the  dynamics  is  equivalent  to  the  phase 
space  dynamics  of  the  pdf  such,  as  given  by  the  FPE.  As  the  network  becomes  more 
complex  its  past  history  becomes  more  and  more  inffuential  until  finally  the  FPE 
must  be  replaced  with  a  FDE  and  the  dynamics  are  the  domain  of  the  fractional 
calculus.  As  we  saw  in  Sec.  5  the  general  solution  to  a  class  of  FDE's  is  a  scaling  pdf 
implying  that  ARs  can  result  from  scaling  in  the  statistical  dynamics  of  the  phe- 
nomenon. The  infiuence  of  the  history  of  the  growth  of  an  organism  on  the  growth 
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of  an  organ  leads  to  intraspecies  AR;  the  history  of  the  size  of  an  organism  on  the 
metabohc  rate  of  the  organism  across  species  leads  to  interspecies  AR. 

The  arguments  in  Sec.  5  imply  that  empirical  ARs  are  a  consequence  of  infor- 
mation transfer  between  complex  information-dominated  networks.  Of  course  we 
have  not  rigorously  proven  that  the  entropy  decrease  (information  increase)  in  the 
host  network  and  the  increase  in  entropy  (information  decrease)  in  the  subnetwork 
have  the  relations  assumed,  this  remains  to  be  done.  However  the  empirical  rela- 
tions are  consistent  with  the  Principle  of  Complexity  Management  in  which  the 
transfer  of  information  between  two  complex  information-dominated  networks  has 
been  shown^'^^^  to  proceed  from  the  network  with  the  greater  information  content 
to  the  network  with  the  lesser. 

The  phenomenological  theory  of  empirical  AR  presented  herein  incorporates 
both  stochastic  and  reductionistic  mechanisms.  From  this  approach  we  conclude 
that  the  empirical  AR  for  the  BMR,  the  life  time  of  an  organism  and  the  myriad 
of  other  complex  phenomena  are  not  completely  explained  by  reductionistic  mech- 
anisms. In  particular  the  allometry  parameters  are  determined  not  to  be  universal. 
This  was  supported  by  the  demonstration  that  the  allometry  exponent  and  coeffi- 
cient co-vary.  On  the  other  hand,  the  laws  from  the  theory  of  probability  can  have 
universal  forms  without  having  universal  parameters,  for  example  the  Law  of  Fre- 
quency of  Errors  and  the  Law  of  Large  Numbers.  The  empirical  AR  is  in  part  a 
consequence  of  the  generic  statistical  behavior  of  complex  networks  that  depend  on 
the  infinitely  divisible  nature  of  the  pdfJ^'^^  The  phenomenological  theory  of  AR 
presented  herein  is  the  application  of  the  probability  calculus  to  the  understanding 
of  the  origins  of  the  empirical  relations  between  averages  of  different  parts  of  a  com- 
plex network.  Consequently  empirical  AR  is  a  manifestation  of  a  law  whose  origin 
can  be  traced  back  to  the  probability  calculus  and  the  balancing  of  deterministic 
and  stochastic  mechanisms. 
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Abstract— T\ve^  problem  of  distributed  learning  and  channel  ac- 
cess is  considered  in  a  cognitive  network  with  multiple  secondary 
users.  The  availability  statistics  of  the  channels  are  initially 
unknown  to  the  secondary  users  and  are  estimated  using  sensing 
decisions.  There  is  no  explicit  information  exchange  or  prior 
agreement  among  the  secondary  users  and  sensing  and  access 
decisions  are  undertaken  by  them  in  a  completely  distributed 
manner.  We  propose  policies  for  distributed  learning  and  ac- 
cess which  achieve  order-optimal  cognitive  system  throughput 
(number  of  successful  secondary  transmissions)  under  self  play, 
i.e.,  when  implemented  at  all  the  secondary  users.  Equivalently, 
our  policies  minimize  the  sum  regret  in  distributed  learning  and 
access,  which  is  the  loss  in  secondary  throughput  due  to  learning 
and  distributed  access.  For  the  scenario  when  the  number  of 
secondary  users  is  known  to  the  policy,  we  prove  that  the  total 
regret  is  logarithmic  in  the  number  of  transmission  slots.  This 
policy  achieves  order-optimal  regret  based  on  a  logarithmic  lower 
bound  for  regret  under  any  uniformly-good  learning  and  access 
policy.  We  then  consider  the  case  when  the  number  of  secondary 
users  is  fixed  but  unknown,  and  is  estimated  at  each  user  through 
feedback.  We  propose  a  policy  whose  sum  regret  grows  only 
slightly  faster  than  logarithmic  in  the  number  of  transmission 
slots. 

Index  7(p™5— Cognitive  medium  access  control,  multi-armed 
bandits,  distributed  algorithms,  logarithmic  regret. 


1.  Introduction 

THERE  has  been  extensive  research  on  cognitive  radio 
network  in  the  past  decade  to  resolve  many  challenges  not 
encountered  previously  in  traditional  communication  networks 
(see  e.g.,  [2]).  One  of  the  main  challenges  is  to  achieve 
coexistence  of  heterogeneous  users  accessing  the  same  part  of 
the  spectrum.  In  a  hierarchical  cognitive  network,  there  are  two 
classes  of  transmitting  users,  viz.,  the  primary  users  who  have 
priority  in  accessing  the  spectrum  and  the  secondary  users  who 
opportunistically  transmit  when  the  primary  user  is  idle.  The 
secondary  users  are  cognitive  and  can  sense  the  spectrum  to 
detect  the  presence  of  a  primary  transmission.  However,  due 
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to  resource  and  hardware  constraints,  they  can  sense  only  a 
part  of  the  spectrum  at  any  given  time. 

We  consider  a  slotted  cognitive  system  where  each  sec- 
ondary user  can  sense  and  access  only  one  orthogonal  channel 
in  each  transmission  slot  (see  Fig.  1).  Under  sensing  con- 
straints, it  is  thus  beneficial  for  the  secondary  users  to  select 
channels  with  higher  mean  availability,  i.e.,  channels  which 
are  less  likely  to  be  occupied  by  the  primary  users.  However, 
in  practice,  the  channel  availability  statistics  are  unknown  to 
the  secondary  users  at  the  start  of  the  transmissions. 

Since  the  secondary  users  are  required  to  sense  the  medium 
before  transmission,  can  these  sensing  decisions  be  used  to 
learn  the  channel  availability  statistics?  If  so,  using  these 
estimated  channel  availabilities,  can  we  design  channel  access 
rules  which  maximize  the  transmission  throughput?  Designing 
provably  efficient  algorithms  to  accomplish  the  above  goals 
forms  the  focus  of  our  paper.  Such  algorithms  need  to  be 
efficient,  both  in  terms  of  learning  and  channel  access. 

For  any  learning  algorithm,  there  are  two  important  perfor- 
mance criteria:  consistency  and  regret  bounds  [3].  A  learning 
algorithm  is  said  to  be  consistent  if  the  learnt  estimates 
converge  to  the  true  values  as  the  number  of  samples  goes 
to  infinity.  The  regret^  of  a  learning  algorithm  is  a  measure 
of  the  speed  of  convergence,  and  is  thus  meaningful  only  for 
consistent  algorithms.  In  our  context,  the  users  need  to  learn 
the  channel  availability  statistics  consistently  in  a  distributed 
manner.  The  regret  in  this  case  is  defined  as  the  loss  in 
secondary  throughput  due  to  learning  when  compared  to  the 
ideal  scenario  where  the  channel  statistics  are  known  perfectly 
(see  (2)).  It  is  thus  desirable  to  design  distributed  learning 
algorithms  with  small  regret. 

Additionally,  we  consider  a  distributed  framework  where 
there  is  no  information  exchange  or  prior  agreement  among 
the  secondary  users.  This  introduces  additional  challenges: 
it  results  in  loss  of  throughput  due  to  collisions  among  the 
secondary  users,  and  there  is  now  competition  among  the 
secondary  users  since  they  all  tend  to  access  channels  with 
higher  availabilities.  It  is  imperative  that  the  channel  access 
policies  overcome  the  above  challenges.  Hence,  a  distributed 
learning  and  access  policy  experiences  regret  both  due  to 
learning  the  unknown  channel  availabilities  as  well  as  due 
to  collisions  under  distributed  access. 

^Note  that  the  sum  regret  is  a  finer  measure  of  performance  of  a 
distributed  algorithm  than  the  time-averaged  total  throughput  of  the  users 
since  any  sub-linear  regret  (with  respect  to  time)  implies  optimal  average 
throughput.  On  the  other  hand,  regret  is  equivalent  to  total  throughput  and 
hence,  optimal  regret  is  equivalent  to  achieving  optimal  total  throughput. 
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Fig.  1.  Cognitive  radio  network  with  U  =  4  secondary  users  and  C  =  5 
channels.  A  secondary  user  is  not  allowed  to  transmit  if  the  accessed  channel 
is  occupied  by  a  primary  user  If  more  than  one  secondary  user  transmits  in 
the  same  free  channel,  then  all  the  transmissions  are  unsuccessful. 


A.  Our  Contributions 

The  main  contributions  of  this  paper  are  two  fold.  First, 
we  propose  two  distributed  learning  and  access  policies  under 
multiple  secondary  users  in  a  cognitive  network.  Second,  we 
provide  performance  guarantees  for  these  policies  in  terms  of 
regret.  Overall,  we  prove  that  one  of  our  proposed  algorithms 
achieves  order-optimal  regret  and  the  other  achieves  nearly 
order-optimal  regret,  where  the  order  is  with  respect  to  the 
number  of  transmission  slots. 

The  first  policy  we  propose  assumes  that  the  total  number 
of  secondary  users  in  the  system  is  known  while  our  second 
policy  removes  this  requirement.  We  provide  bounds  on  the 
total  regret  experienced  by  the  secondary  users  under  self 
play,  i.e.,  when  implemented  at  all  the  secondary  users.  For 
the  first  policy,  we  prove  that  the  regret  is  logarithmic,  i.e., 
0(log  n)  where  n  in  the  number  of  transmission  slots.  For  the 
second  policy,  the  regret  grows  slightly  faster  than  logarithmic, 
i.e.,  0(/(n)  log  n),  where  we  can  choose  any  function  /(n) 
satisfying  /(n)  oo,  as  n  oo.  Hence,  we  provide 
performance  guarantees  for  the  proposed  distributed  learning 
and  access  policies. 

A  lower  bound  on  regret  under  any  uniformly- good  dis- 
tributed learning  policy  has  been  derived  in  [4],  which  is  also 
logarithmic  in  the  number  of  transmission  slots  (See  (6)  for 
definition  of  uniformly-good  policies).  Thus,  our  first  policy 
(which  requires  knowledge  of  the  number  of  secondary  users) 
achieves  order-optimal  regret.  The  effects  of  the  number  of 
secondary  users  and  the  number  of  channels  on  regret  are 
also  explicitly  characterized  and  verified  via  simulations. 

The  exploration- exploitation  tradeoff  for  learning,  com- 
bined with  the  cooperation-competition  tradeoffs  among  mul- 
tiple users  for  distributed  medium  access  have  not  been 
sufficiently  examined  in  the  literature  (see  Section  I-B  for  a 
discussion).  Our  analysis  in  this  paper  provides  important  en- 
gineering insights  towards  dealing  with  learning,  competition, 
and  cooperation  in  practical  cognitive  systems. 


Remark  on  Model  Assumptions:  We  note  some  of  the 
limitations  of  our  approach.  We  prove  order-optimal  regret  for 
our  proposed  algorithms  under  an  i.i.d.  primary  transmission 
model^,  where  the  probability  of  a  channel  being  occupied 
by  any  primary  user  in  any  slot  is  assumed  to  be  i.i.d. 
This  is  indeed  is  idealistic  and  in  practice,  a  Markovian 
model  may  be  more  appropriate  [5],  [6].  The  i.i.d.  model 
leads  to  simple  learning  schemes  for  channel  availabilities 
but  is  not  crucial  towards  deriving  regret  bounds  for  our 
proposed  schemes.  Extensions  of  the  classical  multi-armed 
bandit  problem  to  a  Markovian  model  are  considered  in  [7]. 
In  principle,  our  results  on  achieving  order-optimal  regret  for 
distributed  learning  and  access  can  be  similarly  extended  to 
a  Markovian  channel  model  but  this  entails  more  complex 
estimators  and  rules  for  evaluating  the  exploration- exploitation 
tradeoffs  of  different  channels  and  is  a  topic  of  interest  for 
future  investigation. 

B.  Related  Work 

Several  results  on  the  multi-armed  bandit  problem  will  be 
used  and  generalized  to  study  our  problem.  Detailed  discussion 
on  multi-armed  bandits  can  be  found  in  [8]-[ll].  Cognitive 
medium  access  is  a  topic  of  extensive  research;  see  [12] 
for  an  overview.  The  connection  between  cognitive  medium 
access  and  the  multi-armed  bandit  problem  is  explored  in 
[13],  where  a  restless  bandit  formulation  is  employed.  Under 
this  formulation,  indexability  is  established.  Whittle's  index 
for  channel  selection  is  obtained  in  closed-form,  and  the 
equivalence  between  the  myopic  policy  and  the  Whittle's 
index  is  established.  However,  this  work  assumes  known 
channel  availability  statistics  and  does  not  consider  competing 
secondary  users.  The  work  in  [14]  considers  allocation  of  two 
users  to  two  channels  under  a  Markovian  channel  model  using 
a  partially  observable  Markov  decision  process  (POMDP) 
framework.  The  use  of  collision  feedback  information  for 
learning,  and  spatial  heterogeneity  in  spectrum  opportunities 
were  investigated.  However,  the  difference  from  our  work 
is  that  [14]  assumes  that  the  availability  statistics  (transition 
probabilities)  of  the  channels  are  known  to  the  secondary  users 
while  we  consider  learning  of  unknown  channel  statistics.  The 
works  in  [15],  [16]  consider  centralized  access  schemes  in 
contrast  to  distributed  access  here,  the  work  in  [17]  considers 
access  through  information  exchange  and  studies  the  optimal 
choice  of  the  amount  of  information  to  be  exchanged  given 
the  cost  of  negotiation.  The  work  in  [18]  considers  access 
under  Q -learning  for  two  users  and  two  channels  where  users 
can  sense  both  the  channels  simultaneously.  The  work  in 
[19]  discusses  a  game-theoretic  approach  to  cognitive  medium 
access.  In  [20],  learning  in  congestion  games  through  multi- 
plicative updates  is  considered  and  convergence  to  weakly- 
stable  equilibria  (which  reduces  to  the  pure  Nash  equilibrium 
for  almost  all  games)  is  proven.  However,  the  work  assumes 
fixed  costs  (or  equivalently  rewards)  in  contrast  to  random 
rewards  here,  and  that  the  players  can  fully  observe  the  actions 
of  other  players. 

^By  i.i.d.  primary  transmission  model,  we  do  not  mean  the  presence  of 
a  single  primary  user,  but  rather,  this  model  is  used  to  capture  the  overall 
statistical  behavior  of  all  the  primary  users  in  the  system. 
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Recently,  the  work  in  [21]  considers  combinatorial  bandits, 
where  a  more  general  model  of  different  (unknown)  channel 
availabilities  is  assumed  for  different  secondary  users,  and  a 
matching  algorithm  is  proposed  for  jointly  allocating  users 
to  channels.  The  algorithm  is  guaranteed  to  have  logarithmic 
regret  with  respect  to  the  number  of  transmission  slots  and 
polynomial  storage  requirements.  A  decentralized  implemen- 
tation of  the  proposed  algorithm  is  proposed  but  it  still 
requires  information  exchange  and  coordination  among  the 
users.  In  contrast,  we  propose  algorithms  which  removes  this 
requirement  albeit  in  a  more  restrictive  setting  of  identical 
channel  availabilities  for  all  users. 

Recently,  Liu  and  Zhao  [4]  proposed  a  family  of  distributed 
learning  and  access  policies  known  as  time-division  fair  share 
(TDFS),  and  proved  logarithmic  regret  for  these  policies.  They 
established  a  lower  bound  on  the  growth  rate  of  system  regret 
for  a  general  class  of  uniformly-good  decentralized  polices. 
The  TDFS  policies  in  [4]  can  incorporate  any  order-optimal 
single-player  policy  while  our  work  here  is  based  on  the 
single-user  policy  proposed  in  [11].  Another  difference  is 
that  in  [4],  the  users  orthogonalize  via  settling  at  different 
offsets  in  their  time-sharing  schedule,  while  in  our  work  here, 
users  orthogonalize  into  different  channels.  Moreover,  the 
TDFS  policies  ensure  that  each  player  achieves  the  same  time- 
average  reward  while  our  policies  here  achieve  probabilistic 
fairness,  in  the  sense  that  the  policies  do  not  discriminate  be- 
tween different  users.  In  [22],  the  TDFS  policies  are  extended 
to  incorporate  imperfect  sensing. 

In  our  prior  work  [1],  we  first  formulated  the  problem 
of  decentralized  learning  and  access  for  multiple  secondary 
users.  We  considered  two  scenarios:  one  where  there  is  initial 
common  information  among  the  secondary  users  in  the  form  of 
pre-allocated  ranks,  and  the  other  where  no  such  information 
is  available.  In  this  paper,  we  analyze  the  distributed  policy  in 
detail  and  prove  that  it  has  logarithmic  regret.  In  addition,  we 
also  consider  the  case  when  the  number  of  secondary  users  is 
unknown,  and  provide  bounds  on  regret  in  this  scenario. 

Organization:  Section  II  deals  with  the  system  model. 
Section  III  deals  with  the  special  case  of  single  secondary 
user  and  of  multiple  users  with  centralized  access  which 
can  be  directly  solved  using  the  classical  results  on  multi- 
armed  bandits.  In  Section  IV,  we  propose  distributed  learning 
and  access  policy  with  provably  logarithmic  regret  when  the 
number  of  secondary  users  is  known.  Section  V  considers  the 
scenario  when  the  number  of  secondary  users  is  unknown. 
Section  VI  provides  a  lower  bound  for  distributed  learning. 
Section  VII  includes  some  simulation  results  for  the  proposed 
schemes  and  Section  VIII  concludes  the  paper.  Most  of  the 
proofs  are  found  in  the  Appendix. 

Since  Section  III  mostly  deals  with  a  recap  of  the  clas- 
sical results  on  multi-armed  bandits,  those  familiar  with  the 
literature  (e.g.,  [8]-[ll])  may  skip  this  section  without  loss  of 
continuity. 

II.  System  Model  &  Formulation 

Notation:  For  any  two  functions  f{n)^g{n),  f{n)  = 
0{g{n))  if  there  exists  a  constant  c  such  that  /(n)  <  cg{n) 
for  all  n  >  no  for  a  fixed  no  G  N.  Similarly,  f{n)=Vt{g{n)) 
if  there  exists  a  constant  d  such  that  /(n)  >  d g{n)  for  all 


n  >  no  for  a  fixed  no  G  N,  and  /(n)  =  Q{g{n))  if  /(n)  = 
ft{g{n))  and  /(n)  =  0{g{n)).  Also,  /(n)  =  o{g{n))  when 
f{n)/g{n)  0  and  /(n)  =  u;{g{n))  when  f{n)/g{n)  oo 
as  n  ^  oo. 

For  a  vector  fi,  let  |/L6|  denote  its  cardinality,  and  let 
fj.  =  [/ii, /i2,  •  •  • ,  M|/L6|]^.  We  refer  to  the  U  highest  entries 
in  a  vector  as  the  [/-best  channels  and  the  rest  as  the  U- 
worst  channels,  where  I  <  U  <  Let  a{a;iJ,)  denote 
the  index  of  the  a*  highest  entry  in  fi.  Alternatively,  we 
abbreviate  a*:=a{a;  fi)  for  ease  of  notation.  Let  B{jj.)  denote 
the  Bernoulli  distribution  with  mean  /i.  With  abuse  of  notation, 
let  D{iii,ii2)-=D{B{iii)]B{ii2))  be  the  Kullback-Leibler 
distance  between  the  Bernoulli  distributions  and  B{ii2) 

[23]  and  let  A(l,  2):=/ii  -  /i2. 

A.  Sensing  &  Channel  Models 

Let  >  1  be  the  number  of  secondary  users^  and  C  >  U 
be  the  number"^  of  orthogonal  channels  available  for  slotted 
transmissions  with  a  fixed  slot  width.  In  each  channel  i  and 
slot  /c,  the  primary  user  transmits  i.i.d.  with  probability  1  — 
/ii  >  0.  In  other  words,  let  Wi{k)  denote  the  indicator  variable 
if  the  channel  is  free 

f  0,       channel  i  occupied  in  slot  k 

I  1,  o.w, 

and  we  assume  that  Wi{k)  B{iii). 

The  mean  availability  vector  ii  consists  of  mean  availabil- 
ities jii  of  all  channels,  i.e.,  is  /L6:=[/ii, . . . , /ic],  where  all 
/ii  G  (0, 1)  and  are  distinct,  ii  is  initially  unknown  to  all  the 
secondary  users  and  is  learnt  independently  over  time  using 
the  past  sensing  decisions  without  any  information  exchange 
among  the  users.  We  assume  that  channel  sensing  is  perfect 
at  all  the  users. 

Let  Tij{k)  denote  the  number  of  slots  where  channel  i  is 
sensed  in  k  slots  by  user  j  (not  necessarily  being  the  sole 
user  to  sense  that  channel).  Note  that  Ylf^i  ^^iji^)  —  ^ 
all  users  j,  since  each  user  senses  exactly  one  channel  in  every 
time  slot.  The  sensing  variables  are  obtained  as  follows:  at  the 
beginning  of  the  /c*  slot,  each  secondary  user  j  G  /7  selects 
exactly  one  channel  z  G  C  for  sensing,  and  hence,  obtains  the 
value  of  Wi{k),  indicating  if  the  channel  is  free.  User  j  then 
records  all  the  sensing  decisions  of  each  channel  i  in  a  vector 
Xl^:=[Xijil),...,XijiT,j{k))f.  Hence,  ug.l^lj  is  the 
collection  of  sensed  decisions  for  user  j  in  k  slots  for  all  the 
C  channels. 

We  assume  the  collision  model  under  which  if  two  or 
more  users  transmit  in  the  same  channel  then  none  of  the 
transmissions  go  through.  At  the  end  of  each  slot  /c,  each 
user  j  receives  acknowledgement  Zj{k)  on  whether  its  trans- 
mission in  the  /c*  slot  was  received.  Hence,  in  general,  any 
policy  employed  by  user  j  in  the  (/c  +  1)'^  slot,  denoted  by 
p(U^xX^^  ,  Z^),  is  based  on  all  the  previous  sensing  and 
feedback  results. 

■^A  user  refers  to  a  secondary  user  unless  otherwise  mentioned. 

^When  U  >  C,  learning  availability  statistics  is  less  crucial,  since  all 
channels  need  to  be  accessed  to  avoid  collisions.  In  this  case,  design  of 
medium  access  is  more  crucial. 
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Algorithm  1  Single  User  Policy  p^(g(n))  in  [10]. 

Input:  {Xi{n)}i=i^,,,^c  •  Sample-mean  availabilities  after  n 

rounds,  g{i;n):  statistic  based  on  Xij{n), 

cr(a;  g(n)):  index  of  a*  highest  entry  in  g(n). 

Init:  Sense  in  each  channel  once,  n  ^  C,  Curr_Sel  ^  C. 

Loop:  n  ^  n  +  1 

Curr_Sel  ^  channel  corresponding  to  highest  entry  in  g(n) 
for  sensing.  If  free,  transmit. 


B.  Regret  of  a  Policy 

Under  the  above  model,  we  are  interested  in  designing 
policies  p  which  maximize  the  expected  number  of  successful 
transmissions  of  the  secondary  users  subject  to  the  non- 
interference constraint  for  the  primary  users.  Let  S{n;  fi^U^  p) 
be  the  expected  total  number  of  successful  transmissions  after 
n  slots  under  U  number  of  secondary  users  and  policy  p. 

In  the  ideal  scenario  where  the  availability  statistics  /jL  are 
known  a  priori  and  a  central  agent  orthogonally  allocates  the 
secondary  users  to  the  U -best  channels,  the  expected  number 
of  successful  transmissions  after  n  slots  is  given  by 


(1) 


where  j*  is  the  jf*-highest  entry  in  /j,. 

It  is  clear  that  S*{n;  /l^,  U)  >  S{n;  /la,  [/,  p)  for  any  policy 
p  and  any  finite  n  G  N.  We  are  interested  in  minimizing  the 
regret  in  learning  and  access,  given  by 

i?(n;  /LA,  U,  p):=S*{n',  /la,  U)  -  S{n]  /jL,  U,  p)  >  0.  (2) 

We  are  interested  in  minimizing  regret  under  any  given  /la  G 
(0, 1)^  with  distinct  elements. 

By  incorporating  the  collision  channel  model  assumption 
with  no  avoidance  mechanisms^,  the  expected  throughput 
under  the  policy  p  is  given  by 

c  u 

s{n;  f^.u,p)  =  Y,Yl  f^mv^An)]. 

i=i  j=i 

where  Vij{n)  is  the  number  of  times  in  n  slots  where  user 
j  is  the  sole  user  to  sense  channel  i.  Hence,  the  regret  in  (2) 
simplifies  as 

u  c  u 

R{n;p)  =  ^nM(r)  -  ^  ^  M(*)E[yij(n)].  (3) 

k=l  i=l  j=l 

III.  Special  Cases  From  Known  Results 

We  recap  the  bounds  for  the  regret  under  the  special  cases 
of  a  single  secondary  user  {U  =  1)  and  multiple  users  with 
centralized  learning  and  access  by  appealing  to  the  classical 
results  on  the  multi-armed  bandit  process  [8]-[10]. 

A.  Single  Secondary  User  ([/  =  !) 

When  there  is  only  one  secondary  user,  the  problem  of 
finding  policies  with  minimum  regret  reduces  to  that  of  a 

^If  the  users  employ  CSMA-CA  to  avoid  collisions  then  the  regret  is 
reduced.  The  bounds  derived  in  this  paper  are  applicable  for  this  case  as 
well. 


multi-armed  bandit  process.  Lai  and  Robbins  [8]  first  analyzed 
schemes  for  multi-armed  bandits  with  asymptotic  logarithmic 
regret  based  on  the  upper  confidence  bounds  on  the  unknown 
channel  availabilities.  Since  then,  simpler  schemes  have  been 
proposed  in  [10],  [1 1]  which  compute  a  statistic  or  an  index  for 
each  arm  (channel),  henceforth  referred  to  as  the  g-statistic, 
based  only  on  its  sample  mean  and  the  number  of  slots  where 
the  particular  arm  is  sensed.  The  arm  with  the  highest  index  is 
selected  in  each  slot  in  these  works.  We  summarize  the  policy 
in  Algorithm  1  and  denote  it  p^(g(n)),  where  g(n)  is  the 
vector  of  scores  assigned  to  the  channels  after  n  transmission 
slots. 

The  sample-mean  based  policy  in  [11,  Thm.  1]  proposes  an 
index  for  each  channel  i  and  user  j  at  time  n  is  given  by 


MEAN 


{i;n):=Xi,,{T,j{n)) 


2  logn 


(4) 


where  Tij{n)  is  the  number  of  slots  where  user  j  selects 
channel  i  for  sensing. 


X,,,(T,,,(n)):=  ^ 


k=l 


is  the  sample-mean  availability  of  channel  i,  as  sensed  by  user 
j- 

The  statistic  in  (4)  captures  the  exploration-exploitation 
tradeoff  between  sensing  the  channel  with  the  best  predicted 
availability  to  maximize  immediate  throughput  and  sensing 
different  channels  to  obtain  improved  estimates  of  their  avail- 
abilities. The  sample-mean  term  in  (4)  corresponds  to  exploita- 
tion while  the  other  term  involving  Tij{n)  corresponds  to 
exploration  since  it  penalizes  channels  which  are  not  sensed 
often.  The  normalization  of  the  exploration  term  with  logn 
in  (4)  implies  that  the  term  is  significant  when  Tij{n)  is 
much  smaller  than  log  n.  On  the  other  hand,  if  all  the  channels 
are  sensed  6 (logn)  number  of  times,  the  exploration  terms 
become  unimportant  in  the  ^-statistics  of  the  channels  and  the 
exploitation  term  dominates,  thereby,  favoring  sensing  of  the 
channel  with  the  highest  sample  mean. 

The  regret  based  on  the  above  statistic  in  (4)  is  logarithmic 
for  any  finite  number  of  slots  n  but  does  not  have  the  optimal 
scaling  constant.  The  sample-mean  based  statistic  in  [10, 
Example  5.7]  leads  to  the  optimal  scaling  constant  for  regret 
and  is  given  by 


^f^(z;n):=X,,,(T,,,(n))+min 


/  logn 
2T.,,(n) 


,1 


(5) 


In  this  paper,  we  design  policies  based  on  the  g^^^^  statistic 
since  it  is  simpler  to  analyze  than  the  g^^^  statistic.  This  is 
because  g^^^^  is  a  continuous  function  in  Tij{n)  while  g^^^ 
has  a  threshold  on  Tij{n). 

We  now  recap  the  results  which  show  logarithmic  regret  in 
learning  the  best  channel.  In  this  context,  we  define  uniformly 
good  policies  p  [8]  as  those  with  regret 


R{n;  /L6,  U,  p)  =  o(n"),    Va  >  0,  /l^  G  (0, 1)^ 


(6) 
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Theorem  1  (Logarithmic  Regret  for  U  =  1  [10],  [11]): 
For  any  uniformly  good  policy  p  satisfying  (6),  the  expected 
time  spent  in  any  suboptimal  channel  z  7^  1*  satisfies 


lim  P 


^    /  N      (1  —  logn 


=  1, 


(7) 


where  1*  is  the  channel  with  the  best  availability.  Hence,  the 
regret  satisfies 

A(l*,i) 


E 

iGl-worst 


(8) 


The  regret  under  the  ^^^^  statistic  in  (5)  achieves  the  above 
bound. 


lim 

n^co 


logn 


iGl-worst 

The  regret  under  g^^^^  statistic  in  (34)  satisfies 

8  logn 


Algorithm  2  Centralized  Learning  Policy  p^^^^  in  [9]. 

Input:  A'^  :=  U^^^^  uf^^  Xfj  :  Channel  availability  after  n 

slots,  g(n):  statistic  based  on  A'^, 

cr(a;  g(n)):  index  of  a*  highest  entry  in  g(n). 

Init:  Sense  in  each  channel  once,  n  ^  C 

Loop:  n  ^  n  +  1 

Curr_Sel  ^  channels  with  [/-best  entries  in  g(n).  If  free, 
transmit. 


Notice  that  for  both  the  single  user  case  U  =  1  (Theorem  1) 
and  the  centralized  multi-user  case  (Theorem  2),  the  number 
of  time  slots  spent  in  the  U- worst  channels  is  9 (logn)  and 
hence,  the  regret  is  also  6 (logn). 


IV.  Main  Results 

Armed  with  the  classical  results  on  multi-armed  bandits,  we 
now  design  distributed  learning  and  allocation  policies. 


B.  Centralized  Learning  &  Access  for  Multiple  Users 

We  now  consider  multiple  secondary  users  under  centralized 
access  policies  where  there  is  joint  learning  and  access  by  a 
central  agent  on  behalf  of  all  the  U  users.  Here,  to  minimize 
the  sum  regret,  the  centralized  policy  allocates  the  U  users  to 
orthogonal  channels  to  avoid  collisions.  Let  p^^^^{X^),  with 
:=  -^ti'  ^^^^te  a  centralized  policy  based 

on  the  sensing  variables  of  all  the  users.  The  policy  under 
centralized  learning  is  a  simple  generalization  of  the  single- 
user  policy  and  is  given  in  Algorithm  2.  We  now  recap  the 
results  of  [9]. 

Theorem  2  (Regret  Under  Centralized  Policy  p^^^^  [9]): 
For  any  uniformly  good  centralized  policy  p^^^^  satisfying 
(6),  the  expected  times  spent  in  a  /7 -worst  channel  i  satisfies 


lim 


(1  —  e)  logn 


(10) 


where  /7*  is  the  channel  with  the  /7*  best  availability.  Hence, 
the  regret  satisfies 


lim  inf 


logn 


> 


y  A{U*,i) 


(11) 


The  scheme  in  Algorithm  2  based  on  achieves  the  above 
bound. 


lim 

n^oo 


J?(n;Ai,l,p^^^^(g°") 
logn 


=  E 


A{U*,i) 


i^U -worst 


D{pi,pu* 


(12) 


The  scheme  in  Algorithm  2  based  on  the 
any  n  >  0, 


satisfies  for 


< 


u 

E 

m=l  iG  C/-worst  k=l 


E  E 


A(m*,i) 


U 


\  logn 


A(m*,z)2 


21 


Proof:     See  Appendix  A. 


(13) 

□ 


A.  Preliminaries:  Bounds  on  Regret 

We  first  provide  simple  bounds  on  the  regret  in  (3)  for  any 
distributed  learning  and  access  policy  p. 

Proposition  1  (Lower  and  Upper  Bounds  on  Regret):  The 
regret  under  any  distributed  policy  p  satisfies 

u 

R{n;p)>Y^  A([/*,z)E[T,,,(n)],  (14) 

j=l  iGf/-worst 
'  U 

R{n',p)<p{r)  ^E[T,,,(n)]+E[M(n)]  ,  (15) 

j=l  iGt/-worst 

where  Tij{n)  is  the  number  of  slots  where  user  j  selects 
channel  i  for  sensing,  M{n)  is  the  number  of  collisions  faced 
by  the  users  in  the  /7-best  channels  in  n  slots,  A(i,  j)  = 
p{i)  —  p{j)  and  /i(l*)  is  the  highest  mean  availability. 
Proof:     See  Appendix  B.  □ 

In  the  subsequent  sections,  we  propose  distributed  learning 
and  access  policies  and  provide  regret  guarantees  for  the 
policies  using  the  upper  bound  in  (15).  The  lower  bound  in 
(14)  can  be  used  to  derive  lower  bound  on  regret  for  any 
uniformly-good  policy. 

The  first  term  in  (15)  represents  the  lost  transmission 
opportunities  due  to  selection  of  U -worst  channels  (with  lower 
mean  availabilities),  while  the  second  term  represents  the 
performance  loss  due  to  collisions  among  the  users  in  the 
/7-best  channels.  The  first  term  in  (15)  decouples  among  the 
different  users  and  can  be  analyzed  solely  through  the  marginal 
distributions  of  the  ^-statistics  at  the  users.  This  in  turn,  can 
be  analyzed  by  manipulating  the  classical  results  on  multi- 
armed  bandits  [10],  [11].  On  the  other  hand,  the  second  term 
in  (15),  involving  collisions  in  the  [/-best  channels,  requires 
the  joint  distribution  of  the  ^-statistics  at  different  users  which 
are  correlated  variables.  This  is  intractable  to  analyze  directly 
and  we  develop  techniques  to  bound  this  term. 
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Algorithm  3  Policy  C,  gj(n))  for  each  user  j  under 

U  users,  C  channels  and  statistic  gj{n). 

Input:  {Xij{n)}i=i^,,,^c  '  Sample-mean  availabilities  at 
user  j  after  n  rounds,  gj{i;n)\  statistic  based  on  Xij{n), 
a{a;  gj{n))\  index  of  a*  highest  entry  in  gj{n). 

n):  indicator  of  collision  at      slot  at  channel  i 
Init:  Sense  in  each  channel  once,  n  ^  C,  Curr_Rank  ^  1, 

Loop:  n  ^  n  +  1 
if  (j{Curr_Sel]  n  —  1)  =  1  then 
Draw  a  new  Curr_Rank  ~  Unif([/) 
end  if 

Select  channel  for  sensing.  If  free,  transmit. 

Curr_Sel  ^  a{Curr_Rank;gj{n)). 

If  collision  (j{Curr_Sel;m)  ^  1,  Else  0. 


B.         :  Distributed  Learning  and  Access 

We  present  the  p^^^^  policy  in  Algorithm  3.  Before  de- 
scribing this  policy,  we  make  some  simple  observations.  If 
each  user  implemented  the  single-user  policy  in  Algorithm  1 , 
then  it  would  result  in  collisions,  since  all  the  users  target 
the  best  channel.  When  there  are  multiple  users  and  there 
is  no  direct  communication  among  them,  the  users  need  to 
randomize  channel  access  in  order  to  avoid  collisions.  At 
the  same  time,  accessing  the  /7-worst  channels  needs  to  be 
avoided  since  they  contribute  to  regret.  Hence,  users  can  avoid 
collisions  by  randomizing  access  over  the  /7-best  channels, 
based  on  their  estimates  of  the  channel  ranks.  However,  if  the 
users  randomize  in  every  slot,  there  is  a  finite  probability  of 
collisions  in  every  slot  and  this  results  in  a  linear  growth  of 
regret  with  the  number  of  time  slots.  Hence,  the  users  need 
to  converge  to  a  collision-free  configuration  to  ensure  that  the 
regret  is  logarithmic. 

In  Algorithm  3,  there  is  adaptive  randomization  based 
on  feedback  regarding  the  previous  transmission.  Each  user 
randomizes  only  if  there  is  a  collision  in  the  previous  slot; 
otherwise,  the  previously  generated  random  rank  for  the  user 
is  retained.  The  estimation  for  the  channel  ranks  is  through 
the  ^-statistic,  along  the  lines  of  the  single-user  case. 


C.  Regret  Bounds  under  p^^ 

It  is  easy  to  see  that  the  p^^^^  policy  ensures  that  the 
users  are  allocated  orthogonally  to  the  /7-best  channels  as 
the  number  of  transmission  slots  goes  to  infinity.  The  regret 
bounds  on  p^^^^  are  however  not  immediately  clear  and  we 
provide  guarantees  below. 

We  first  provide  a  logarithmic  upper  bound^  on  the  number 
of  slots  spent  by  each  user  in  any  U -worst  channel.  Hence,  the 
first  term  in  the  bound  on  regret  in  (15)  is  also  logarithmic. 

Lemma  1  (Time  Spent  in  U -worst  Channels):  Under  the 
^RAND  scheme  in  Algorithm  3,  the  total  time  spent  by  any  user 

^Note  that  the  bound  on  E[Ti^j(n)]  in  (16)  holds  for  user  j  even  if  the 
other  users  are  using  a  policy  other  than  p^^^^.  But  on  the  other  hand,  to 
analyze  the  number  of  collisions  E[M(n)]  in  (19),  we  need  every  user  to 
implement  p^^^°. 


j  =  1, ...,[/,  in  any  i  ^  U -worst  channel  is  given  by 


k=i 


8  logn 
A(z,A:*)2 


y 


(16) 


Proof:  The  proof  is  similar  along  the  lines  of  Theorem  2, 
given  in  Appendix  A.  The  only  difference  is  that  here,  we 
need  to  consider  the  probability  of  transmission  of  each  user 
in  each  U -worst  channel  while  in  Appendix  A,  the  probability 
of  transmission  by  all  users  is  considered.  □ 
We  now  focus  on  analyzing  the  number  of  collisions  M(n) 
in  the  U -best  channels.  We  first  give  a  result  on  the  expected 
number  of  collisions  in  the  ideal  scenario  where  each  user  has 
perfect  knowledge  of  the  channel  availability  statistics  ii.  In 
this  case,  the  users  attempt  to  reach  an  orthogonal  (collision- 
free)  configuration  by  uniformly  randomizing  over  the  U -best 
channels. 

The  stochastic  process  in  this  case  is  a  finite-state  Markov 
chain.  A  state  in  this  Markov  chain  corresponds  to  a  con- 
figuration of  U  number  of  (identical)  users  in  U  number  of 
channels.  The  number  of  states  in  the  Markov  chain  is  the 
number  of  compositions  of  given  by  ('^^-^)  [24,Thm.5.1]. 
The  orthogonal  configuration  corresponds  to  the  absorbing 
state.  There  are  two  other  classes  of  states  in  the  Markov 
chain.  One  class  consists  of  states  in  which  each  channel  either 
has  more  than  one  user  or  no  user  at  all.  The  second  class 
consists  of  the  remaining  states  where  some  channels  may 
have  just  one  user.  For  the  first  class,  the  transition  probability 
to  any  state  of  the  Markov  chain  (including  self  transition  and 
absorption  probabilities)  is  uniform.  For  a  state  in  the  second 
class,  where  certain  channels  have  exactly  one  user,  there  are 
only  transitions  to  states  which  have  identical  configuration  of 
the  single  users  and  the  transition  probabilities  are  uniform. 
Let  T(U^U)  denote  the  maximum  time  to  absorption  in  the 
above  Markov  chain  starting  from  any  initial  distribution.  We 
have  the  following  result 

Lemma  2  (No.  of  Collisions  Under  Perfect  Knowledge): 
The  expected  number  of  collisions  under  p^^^^  scheme  in 
Algorithm  3,  assuming  that  each  user  has  perfect  knowledge 
of  the  mean  channel  availabilities  /j,,  is  given  by 


E[M(n);/ 


\U,C,^)]<UE[r{U,U)] 


<  U 


U 


(17) 


Proof:  See  Appendix  C.  □ 
The  above  result  states  that  there  is  at  most  a  finite  number 
of  expected  collisions,  bounded  by  UE[T{U^  U)]  under  perfect 
knowledge  of  fi.  In  contrast,  recall  from  the  previous  section, 
that  there  are  no  collisions  under  perfect  knowledge  of  fi 
in  the  presence  of  pre-allocated  ranks.  Hence,  /7E[T(/7,  U)] 
represents  a  bound  on  the  additional  regret  due  to  the  lack 
of  direct  communication  among  the  users  to  negotiate  their 
ranks. 

We  use  the  result  of  Lemma  2  for  analyzing  the  num- 
ber of  collisions  under  distributed  learning  of  the  unknown 
availabilities  /jL  as  follows:  if  we  show  that  the  users  are 
able  to  learn  the  correct  order  of  the  different  channels  with 
only  logarithmic  regret  then  only  an  additional  finite  expected 
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number  of  collisions  occur  before  reaching  an  orthogonal 
configuration. 

Define  T'(n;  as  the  number  of  slots  where  any  one 

of  the  top  U -estimated  ranks  of  the  channels  at  some  user  is 
wrong  under  the  p^^^^  policy.  Below  we  prove  that  its  expected 
value  is  logarithmic  in  the  number  of  transmissions. 

Lemma  3  (Wrong  Order  of  g -statistics):  Under  the  p^^^^ 
scheme  in  Algorithm  3, 


c 


< 


a—l  5=a+l 


8  logn 


A(a*,6*)2  '  3 


.(18) 


Proof:  See  Appendix  D.  □ 
We  now  provide  an  upper  bound  on  the  number  of  collisions 
M{n)  in  the  U -best  channels  by  incorporating  the  above  result 
on  E[T'(n)],  the  result  on  the  average  number  of  slots  IE[Ti^j] 
spent  in  the  i  G  /7 -worst  channels  in  Lemma  1  and  the  average 
number  of  collisions  [/E[T(/7,  U)]  under  perfect  knowledge 
of  II  in  Lemma  2. 

Theorem  3  (Logarithmic  Number  of  Collisions  Under  p'^^): 
The  expected  number  of  collisions  in  the  [/-best  channels 
under  /^^°(/7,  C,  g''^^^)  scheme  satisfies 

E[M(n)]  <  U{E[T{U,U)]  +  l)E[T^{n)].  (19) 

Hence,  from  (16),  (18)  and  (17),  M(n)  =  O(logn). 
Proof:  See  Appendix  E.  n 
Hence,  there  are  only  a  logarithmic  number  of  expected 
collisions  before  the  users  settle  in  the  orthogonal  channels. 
Combining  this  result  with  Lemma  1  that  the  number  of 
slots  spent  in  the  [/-worst  channels  is  also  logarithmic,  we 
immediately  have  one  of  the  main  results  of  this  paper 
that  the  sum  regret  under  distributed  learning  and  access  is 
logarithmic. 

Theorem  4  (Logarithmic  Regret  Under  p^^^):  The  policy 
^RAND(^^^^  gMEAN)  in  AlgoHthm  3  has  6 (logn)  regret. 
Proof:     Substituting  (19)  and  (16)  in  (15).  □ 

Hence,  we  prove  that  distributed  learning  and  channel 
access  among  multiple  secondary  users  is  possible  with  log- 
arithmic regret  without  any  explicit  communication  among 
the  users.  This  implies  that  the  number  of  lost  opportunities 
for  successful  transmissions  at  all  secondary  users  is  only 
logarithmic  in  the  number  of  transmissions,  which  is  negligible 
when  there  are  a  large  number  of  transmissions. 

We  have  so  far  focused  on  designing  schemes  that  maximize 
system  or  social  throughput.  We  now  briefly  discuss  the 
fairness  for  an  individual  user  under  Since  p^^^^  does 

not  distinguish  any  of  the  users,  in  the  sense  that  each  user 
has  equal  probability  of  "settling"  down  in  one  of  the  /7-best 
channels  while  experiencing  only  logarithmic  regret  in  doing 
so.  Simulations  in  Section  VII  (in  Fig.  4)  demonstrate  this 
phenomenon. 

We  also  note  that  the  bound  on  regret  under  p^^^^  grows 
rapidly  with  the  number  of  users  U ,  due  to  the  bound  in 
(17).  This  is  due  to  uniform  channel  access  by  the  users 
without  any  coordination  or  information  exchange.  It  is  of 
interest  to  explore  better  distributed  access  schemes,  which 
when  combined  with  learning  algorithms  yield  low  regret  in 
the  number  of  users. 


Algorithm  4  Policy  p^^^(n,  C,  gj (m),  ^)  for  each  user  j  under 
n  transmission  slots  (horizon  length),  C  channels,  statistic 
gj{m)  and  threshold  function  ^. 

1)  Input:  {Xij{n)}i=i^,,,^c  •  Sample-mean  availabilities 
at  user  j,  gj{i;  n):  statistic  based  on  X^^j(n), 
cr(a;gj(n)):  index  of  a*  highest  entry  in  gj(n). 

^  (i;  n):  indicator  of  collision  at      slot  at  channel  i 
U :  current  estimate  of  the  number  of  users, 
n:  horizon  (total  number  of  slots  for  transmission) 
^(n;  k)\  threshold  functions  for    =  1, . . . ,  C 

2)  Init:  Sense  each  channel  once,  m  ^  C,  Curr_Sel  ^ 
C  Curr_Rank  ^  1,  /7  ^  1,  (j{i;m)  ^  0  for  all 
i  =  l,...,C 

3)  Loop  4  to  6:  m  ^  m  + 1,  stop  when  m  =  n  or  U  =  C. 

4)  If  Cj{Curr_Sel;  m  -  1)  =  1  then^ 

Draw  a  new  Curr_Rank  ~  Unif(/7).  end  if 
Select  channel  for  sensing.  If  free,  transmit. 

Curr_Sel  ^  a{Curr_Rank;gj{m)) 

5)  (j{Curr_Sel;m)  ^  1  if  collision,  0  o.w. 

6)  If  Er^i  ELi  Ci(<T(A:;  g,(m));  a)  >  ^{n;  U))  then 

U  ^  U  +  1,  Cj{i',a)  ^  0,  i  =  1, . . .  C,  a  =  1, . . . ,  m. 
end  if 


V.  Distributed  Learning  and  Access  under 
Unknown  Number  of  Users 

We  have  so  far  assumed  that  the  number  of  secondary 
users  is  known,  and  is  required  for  the  implementation  of 
the  p^^^^  policy.  In  practice,  this  entails  initial  announcement 
from  each  of  the  secondary  users  to  indicate  their  presence  in 
the  cognitive  network.  However,  in  a  truly  distributed  setting 
without  any  information  exchange  among  the  users,  such  an 
announcement  may  not  be  possible. 

In  this  section,  we  consider  the  scenario,  where  the  number 
of  users  U  is  unknown  (but  fixed  throughout  the  duration  of 
transmissions  and  U  <  C,  the  number  of  channels).  In  this 
case,  the  policy  needs  to  estimate  the  number  of  secondary 
users  in  the  system,  in  addition  to  learning  the  channel 
availability  statistics  and  designing  channel  access  rules  based 
on  collision  feedback.  Note  that  if  the  policy  assumed  the 
worst-case  scenario  that  U  =  C,  then  the  regret  grows  linearly 
since  there  is  a  positive  probability  that  the  U -worst  channels 
are  selected  for  sensing  in  any  time  slot. 

A.  Description  of  p^^^  Policy 

We  now  propose  a  policy  p^^^  in  Algorithm  4.  This  policy 
incorporates  two  broad  aspects  in  each  transmission  slot,  viz., 
execution  of  the  p^^^^  policy  in  Algorithm  3,  based  on  the 
current  estimate  of  the  number  of  users  /7,  and  updating  of 
the  estimate  U  based  on  the  number  of  collisions  experienced 
by  the  user. 

The  updating  is  based  on  the  idea  that  if  there  is  under- 
estimation of  U  at  all  the  users  (Uj  <  U  at  all  the  users  j), 
collisions  necessarily  build  up  and  the  collision  count  serves 
as  a  criterion  for  incrementing  U.  This  is  because  after  a  long 
learning  period,  the  users  learn  the  true  ranks  of  the  channels, 
and  target  the  same  set  of  channels.  However,  when  there  is 
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under-estimation,  the  number  of  users  exceeds  the  number  of 
channels  targeted  by  the  users.  Hence,  collisions  among  the 
users  accumulate,  and  can  be  used  as  a  test  for  incrementing 
U. 

Denote  the  collision  count  used  by  the  p^^^  policy  as 


thresholds  ^(n)  to  test  against  the  collision  counts  obtained 
through  feedback. 

Conditional  Regret:  We  now  give  the  result  for  the  first 
part.  Define  the  "good  event"  C{n;U)  that  none  of  the  users 
over-estimates  U  under  p^^^  as 


*fc,iM  ■■=  ^^0K6;g,(m));a).  (20) 

a=l 6=1 

which  is  the  total  number  of  collisions  experienced  by  user  j 
so  far  (till  the  transmission  slot)  in  the  top  -channels, 
where  the  ranks  of  the  channels  are  estimated  using  the  g- 
statistics.  The  collision  count  is  tested  against  a  threshold 
^{n;Uj),  which  is  a  function  of  the  horizon  length^  and 
current  estimate  Uj.  When  the  threshold  is  exceeded,  Uj  is 
incremented,  and  the  collision  samples  collected  so  far  are 
discarded  (by  setting  them  to  zero)  (line  6  in  Algorithm  4). 
The  choice  of  the  threshold  for  incrementing  Uj  is  critical;  if 
it  is  too  small  it  can  result  in  over-estimation  of  the  number 
of  users.  On  the  other  hand,  if  it  is  too  large,  it  can  result  in 
slow  learning  and  large  regret.  Threshold  selection  is  studied 
in  detail  in  the  subsequent  section. 

B.  Regret  Bounds  under  p^^^ 

We  analyze  regret  bounds  under  the  p^^^  policy,  where  the 
regret  is  defined  in  (3).  Let  the  maximum  threshold  function 
for  the  number  of  consecutive  collisions  under  the  p^^^  policy 
be  denoted  by 


ein;U):-- 


-    max  ^(n:k). 


(21) 


We  prove  that  the  p^^^  policy  has  0(^*(n;/7))  regret  when 
^*(n;  U)  =  cc;(logn),  and  where  n  is  the  number  of  transmis- 
sion slots. 

The  proof  for  the  regret  bound  under  the  p^^^  policy  consists 
of  two  main  parts:  we  prove  bounds  on  regret  conditioned 
on  the  event  that  none  of  the  users  over-estimate  U.  Second, 
we  show  that  the  probability  of  over-estimation  at  any  of  the 
users  goes  to  zero  asymptotically  in  the  horizon  length.  Taken 
together,  we  obtain  the  regret  bound  for  the  p^^^  policy. 

Note  that  in  order  to  have  small  regret,  it  is  crucial  that  none 
of  the  users  over-estimate  U.  This  is  because  when  there  is 
over-estimation,  there  is  a  finite  probability  of  selecting  the 
/7-worst  channels  even  upon  learning  the  true  ranks  of  the 
channels.  Note  that  regret  is  incurred  whenever  a  U- worst 
channel  is  selected  since  under  perfect  knowledge  this  channel 
would  not  be  selected.  Hence,  under  over-estimation,  the  regret 
grows  linearly  in  the  number  of  transmissions. 

In  a  nutshell,  under  the  p^^^  policy,  the  decision  to  increment 
the  estimate  U  reduces  to  a  hypothesis-testing  problem  with 
hypotheses  Ho-  number  of  users  is  less  than  or  equal  to  the 
current  estimate  and  Hi',  number  of  users  is  greater  than 
the  current  estimate.  In  order  to  have  a  sub-linear  regret, 
the  false-alarm  probability  (deciding  Hi  under  Ho)  needs  to 
decay  asymptotically.  This  is  ensured  by  selecting  appropriate 

^In  this  section,  we  assume  that  the  users  are  aware  of  the  horizon  length 
n  for  transmission.  Note  that  this  is  not  a  limitation  and  can  be  extended 
to  case  of  unknown  horizon  length  as  follows:  implement  the  algorithm  by 
fixing  horizon  lengths  to  no ,  2no ,  4no  ...  for  a  fixed  no  G  N  and  discarding 
estimates  from  previous  stages. 


C{n;U):={f]U]^\n)<U}. 


(22) 


The  regret  conditioned  on  C{n;U),  denoted  by 
R{n;     U,  p^'^)|C(n;  U),  is  given  by 


c  u 


'  E  /^(^*)  -EE  M(i)E[F,j(n)|C(n;  [/)], 


k=l 


=  1  j  =  l 


where  Vij{n)  is  the  number  of  times  that  user  j  is  the  sole 
user  of  channel  i.  Similarly,  we  have  conditional  expectations 
of  E[Tij{n)\C{n;  U)]  and  of  the  number  of  collisions  in  U- 
best  channels,  given  by  E[M{n)\C{n;  U)].  We  now  show  that 
the  regret  conditioned  on  C(n;  U)  is  0(max(^*(n;  /7),  logn)). 

Lemma  4:  (Conditional  Regret):  When  all  the  U  secondary 
users  implement  the  p^^^  policy,  we  have  for  ail  i  E  U -worst 
channel  and  each  user  j  =  !,...,[/, 


E[Tij{n)\C{n)]  <  ^ 


k=l 


8  logn 
A(z,fc*)^ 


1 


TT 

y 


(23) 


The  conditional  expectation  on  number  of  collisions  M{n)  in 
the  U -best  channel  satisfies 

u 

E[M{n)\C{n;U)]  <  U^^{n;k)  <  U^C{n;U).  (24) 

k=l 

From  (15),  we  have  R{n)\C{n;  U)  is  0(max(^*(n;  [/),  logn)) 
for  any  n  G  N. 

Proof:  See  Appendix  F.  □ 
Probability  of  Over-estimation:  We  now  prove  that  none 
of  the  users  over-estimates^  U  under  the  p^^^  policy,  i.e.,  the 
probability  of  the  event  C(n;  U)  in  (22)  approaches  one  as 
n  ^  oo,  when  the  thresholds  ^{n;U)  for  testing  against 
the  collision  count  are  chosen  appropriately  (see  line  6  in 
Algorithm  4).  Trivially,  we  can  set  ^(n;  1)  =  1  since  a  single 
collision  is  enough  to  indicate  that  there  is  more  than  one  user. 
For  any  other  /c  >  1,  we  choose  function  ^  satisfying 


^(n; /c)  =  cc;(logn),    V/c  >  1. 


(25) 


We  prove  that  the  above  condition  ensures  that  over-estimation 
does  not  occur. 

Recall  that  T'{n;  p^^^)  is  the  number  of  slots  where  any 
one  of  the  top  [/-estimated  ranks  of  the  channels  at  some 
user  is  wrong  under  the  p^^^  policy.  We  show  that  E[T'(n)]  is 
O(logn). 

Lemma  5  (Time  spent  with  wrong  estimates):  The 
expected  number  of  slots  where  any  of  the  top  [/-estimated 

^Note  that  the  p^^^  policy  automatically  ensures  that  all  the  users  do  not 
under-estimate  U,  since  it  increments  U  based  on  collision  estimate.  This 
implies  that  the  probability  of  the  event  that  all  the  users  under-estimate  U 
goes  to  zero  asymptotically. 
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ranks  of  the  channels  at  any  user  is  wrong  under  the 
policy  satisfies 

E[T'{n)]<uf2  E 


0=1  b=a-\-l 


8  logn 


A(a*,6*)2 


-IH  

3 


(26) 


□ 


Proof:     The  proof  is  on  the  lines  of  Lemma  3 

Recall  the  definition  of  T(/7,  /7)  in  the  previous  section, 
as  the  maximum  time  to  absorption  starting  from  any  initial 
distribution  of  the  finite- state  Markov  chain,  where  the  states 
correspond  to  different  user  configurations  and  the  absorbing 
state  corresponds  to  the  collision-free  configuration.  We  now 
generalize  the  definition  to  T(/7,  /c),  as  the  time  to  absorption 
in  a  new  Markov  chain,  where  the  state  space  is  the  set  of 
configurations  of  U  users  in  k  channels,  and  the  transition 
probabilities  are  defined  on  similar  lines.  Note  that  T(U,k) 
is  almost-surely  finite  when  k  >  U  and  00  otherwise  (since 
there  is  no  absorbing  state  in  the  latter  case). 

We  now  bound  the  maximum  value  of  the  collision  count 
^kji'm)  under  the  p^^^  policy  in  (20)  using  T'{m),  the  total 
time  spent  with  wrong  channel  estimates,  and  T{U,k),  the 

St 

time  to  absorption  in  the  Markov  chain.  Let  <  denote  the 
stochastic  order  for  two  random  variables  [25]. 

Proposition  2:  The  maximum  collision  count  in  (20)  over 
all  users  under  the  p^^^  policy  satisfies 

max   ^k,j{rn)  <  {T\m)  +  l)T(/7,  k),    Vm  G  N.  (27) 

j=l,...,U 

Proof:  The  proof  is  on  the  lines  of  Theorem  3.  See  Ap- 
pendix G.  □ 

We  now  prove  that  the  probability  of  over-estimation  goes 
to  zero  asymptotically. 

Lemma  6  (No  Over-estimation  Under  p^^^):  For  threshold 
function  ^  satisfying  (25),  the  event  C{n;U)  in  (22)  satisfies 


lim  ¥[C{n-U)]  =  1, 


(28) 


and  hence,  none  of  the  users  over-estimates  U  under  the  p^^^ 
policy. 

Proof:     See  Appendix  H.  n 
We  now  give  the  main  result  of  this  section  that  p^^^  has 
slightly  more  than  logarithmic  regret  asymptotically  and  this 
depends  on  the  threshold  function  ^*(n;  U)  in  (21). 

Theorem  5  (Asymptotic  Regret  Under  p^^^):  With 
threshold  function  ^   satisfying  conditions  in   (25),  the 
policy  p^^^{n,C,gj{m),(_)  in  Algorithm  4  satisfies 


R{n;fi,U,p^'') 
limsup  — — —        <  00. 


(29) 


e(n;/7) 

Proof:  From  Lemma  4  and  Lemma  6.  □ 
Hence,  the  regret  under  the  proposed  p^^^  policy  is 
0(^*(n;[/))  under  fully  decentralized  setting  without  the 
knowledge  of  number  of  users  when  ^*{n;U)  =  cj(logn). 
Hence,  0(/(n)logn)  regret  is  achievable  for  all  functions 
/(n)  ^  00  as  n  ^  00.  The  question  of  whether  logarithmic 
regret  is  possible  under  unknown  number  of  users  is  of 
interest. 

Note  the  difference  between  the  p^^^  policy  in  Algorithm  4 
under  unknown  number  of  users  with  the  p^^^^  policy  with 
known  number  of  users  in  Algorithm  3.  The  regret  under  p^^^ 


is  0(/(n)logn)  for  any  function  /(n)  =  cc;(l),  while  it  is 
0(log  n)  under  the  p^^^^  policy.  Hence,  we  are  able  to  quantify 
the  degradation  of  performance  when  the  number  of  users  is 
unknown. 

VL  Lower  Bound  &  Effect  of  Number  of  Users 
A.  Lower  Bound  For  Distributed  Learning  &  access 

We  have  so  far  designed  distributed  learning  and  access 
policies  with  provable  bounds  on  regret.  We  now  discuss  the 
relative  performance  of  these  policies,  compared  to  the  optimal 
learning  and  access  policies.  This  is  accomplished  by  noting 
a  lower  bound  on  regret  for  any  uniformly- good  policy,  first 
derived  in  [4]  for  a  general  class  of  uniformly-good  time- 
division  policies.  We  restate  the  result  below. 

Theorem  6  (Lower  Bound  [4]):  For  any  uniformly  good 
distributed  learning  and  access  policy  p,  the  sum  regret  in 
(2)  satisfies 


n^oo  logn 


E 


iGC/-worst  j 


1  D{pi,Pr)' 


(30) 


The  lower  bound  derived  in  [9]  for  centralized  learning  and 
access  holds  for  distributed  learning  and  access  considered 
here.  But  a  better  lower  bound  is  obtained  above  by  consid- 
ering the  distributed  nature  of  learning.  The  lower  bound  for 
distributed  policies  is  worse  than  the  bound  for  the  centralized 
policies  in  (1 1).  This  is  because  each  user  independently  learns 
the  channel  availabilities  /l^  in  a  distributed  policy,  whereas 
sensing  decisions  from  all  the  users  are  used  for  learning  in  a 
centralized  policy. 

Our  distributed  learning  and  access  policy  p^^^^  matches 
the  lower  bound  on  regret  in  (15)  in  the  order  (logn)  but  the 
scaling  factors  are  different.  It  is  not  clear  if  the  regret  lower 
bound  in  (30)  can  be  achieved  by  any  policy  under  no  explicit 
information  exchange  and  is  a  topic  for  future  investigation. 

B.  Behavior  with  Number  of  Users 

We  have  so  far  analyzed  the  sum  regret  under  our  policies 
under  a  fixed  number  of  users  U.  We  now  analyze  the  behavior 
of  regret  growth  as  U  increases  while  keeping  the  number  of 
channels  C  >  U  fixed. 

Theorem  7  (Varying  Number  of  Users):  When  the  number 
of  channels  C  is  fixed  and  the  number  of  users  U  <  C  is 
varied,  the  sum  regret  under  centralized  learning  and  access 
^cENT  Q2)  decreases  as  U  increases  while  the  upper  bounds 
on  the  sum  regret  under  p^^^^  in  (15)  monotonically  increases 
with  U. 

Proof:  The  proof  involves  analysis  of  (12)  and  (15).  To 
prove  that  the  sum  regret  under  centralized  learning  and  access 
in  (12)  decreases  with  the  number  of  users  U,  it  suffices  to 
show  that  for  i  G  U -worst  channel, 

A(?7*,z) 

D{pi,pu*) 

decreases  as  U  increases.  Note  that  /i(/7*)  and  D{pi,pu*) 
decrease  as  U  increases.  Hence,  it  suffices  to  show  that 
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decreases  with  U.  This  is  true  since  its  derivative  with  respect 
to  U  is  negative. 

For  the  upper  bound  on  regret  under  p^^^^  in  (15),  when  U 
is  increased,  the  number  of  [/-worst  channels  decreases  and 
hence,  the  first  term  in  (15)  decreases.  However,  the  second 
term  consisting  of  collisions  M(n)  increases  to  a  far  greater 
extent.  □ 

Note  that  the  above  results  are  for  the  upper  bound  on  regret 
under  the  p^^^^  policy  and  not  the  regret  itself.  Simulations  in 
Section  VII  reveal  that  the  actual  regret  also  increases  with  U. 
Under  the  centralized  scheme  p^^^^,  as  U  increases,  the  number 
of  [/-worst  channels  decreases.  Hence,  the  regret  decreases, 
since  there  are  less  number  of  possibilities  of  making  bad 
decisions.  However,  for  distributed  schemes  although  this 
effect  exists,  it  is  far  outweighed  by  the  increase  in  regret 
due  to  the  increase  in  collisions  among  the  U  users. 

In  contrast,  the  distributed  lower  bound  in  (30)  displays 
anomalous  behavior  with  U  since  it  fails  to  account  for 
collisions  among  the  users.  Here,  as  U  increases  there  are 
two  competing  effects:  a  decrease  in  regret  due  to  decrease 
in  the  number  of  U -worst  channels  and  an  increase  in  regret 
due  to  increase  in  the  number  of  users  visiting  these  U -worst 
channels. 

VII.  Numerical  Results 

We  present  simulations  for  the  algorithms  developed  in 
the  paper,  varying  the  number  of  users  and  channels  to 
verify  the  performance  of  the  algorithms  detailed  earlier.  We 
consider  C=  9  channels  (or  a  subset  of  them  when  the  num- 
ber of  channels  is  varying)  with  probabilities  of  availability 
characterized  by  Bernoulli  distributions  with  evenly  spaced 
parameters  ranging  from  0.1  to  0.9. 

Comparison  of  Different  Schemes:  Fig.  2a  compares  the 
regret  under  the  centralized  and  random  allocation  schemes  in 
a  scenario  with  U  =  4  cognitive  users  vying  for  access  to  the 
(7  =  9  channels.  The  theoretical  lower  bound  for  the  regret 
in  the  centralized  case  from  Theorem  2  and  the  distributed 
case  from  Theorem  6  are  also  plotted.  The  upper  bounds  on 
the  random  allocation  scheme  from  Theorem  4  is  not  plotted 
here,  since  the  bounds  are  loose  especially  as  the  number  of 
users  U  increases.  Finding  tight  upper  bounds  is  a  subject  of 
future  study. 

As  expected,  centralized  allocation  has  the  least  regret. 
Another  important  observation  is  the  gap  between  the  lower 
bounds  on  the  regret  and  the  actual  regret  in  both  the  dis- 
tributed and  the  centralized  cases.  In  the  centralized  scenario, 
this  is  simply  due  to  using  the  g^^^^  statistic  in  (34)  instead 
of  the  optimal  g^^^  statistic  in  (5).  However,  in  the  distributed 
case,  there  is  an  additional  gap  since  we  do  not  account  for 
collisions  among  the  users.  Hence,  the  schemes  under  con- 
sideration are  O(logn)  and  achieve  order  optimality  although 
they  are  not  optimal  in  the  scaling  constant. 

Performance  with  Varying  U  and  C:  Fig.  3a  explores  the 
impact  of  increasing  the  number  of  secondary  users  U  on  the 
regret  experienced  by  the  different  policies  with  the  number 
of  channels  C  fixed.  With  increasing  U,  the  regret  decreases 
for  the  centralized  schemes  and  increases  for  the  distributed 
schemes,  as  predicted  in  Theorem  7.  The  monotonic  increase 
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of  regret  under  random  allocation  p^^^^  is  a  result  of  the 
increase  in  the  collisions  as  U  increases.  On  the  other  hand, 
the  monotonic  decreasing  behavior  in  the  centralized  case  is 
because  as  the  number  of  users  increases,  the  number  of  U- 
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worst  channels  decreases  resulting  in  lower  regret.  Also,  the 
lower  bound  for  the  distributed  case  in  (30)  initially  increases 
and  then  decreases  with  U.  This  is  because  as  U  increases 
there  are  two  competing  effects:  decrease  in  regret  due  to 


decrease  in  number  of  U -worst  channels  and  increase  in  regret 
due  to  increase  in  number  of  users  visiting  these  /7-worst 
channels. 

Fig.  3b  evaluates  the  performance  of  the  different  algorithms 
as  the  number  of  channels  C  is  varied  while  fixing  the  number 
of  users  U.  The  probability  of  availability  of  each  additional 
channel  is  set  higher  than  those  already  present.  Here,  the 
regret  monotonically  increases  with  C  in  all  cases.  When  the 
number  of  channels  increases  along  with  the  quality  of  the 
channels,  the  regret  increases  as  a  result  of  an  increase  in  the 
number  of  U -worst  channels  as  well  as  the  increasing  gap  in 
quality  between  the  U -best  and  U -worst  channels. 

Also,  the  situation  where  the  ratio  U/C  is  fixed  to  be  0.5 
and  both  the  number  of  users  and  channels  along  with  their 
quality  increase  is  considered  in  Fig.  3c  and  we  find  that 
the  regret  grows  in  this  case  as  well.  Once  again,  this  is  in 
agreement  with  theory  since  the  number  of  U -worst  channels 
increases  as  U  and  C  increase  while  keeping  U/C  fixed. 

Collisions  and  Learning:  Fig.  2c  verifies  the  logarithmic 
nature  of  the  number  of  collisions  under  the  random  alloca- 
tion scheme  Additionally,  we  also  plot  the  number  of 
collisions  under  p^^^^  in  the  ideal  scenario  when  the  channel 
availability  statistics  /j.  are  known  to  see  the  effect  of  learning 
on  the  number  of  collisions.  The  low  value  of  the  number 
of  collisions  obtained  under  known  channel  parameters  in 
the  simulations  is  in  agreement  with  theoretical  predictions, 
analyzed  as  UE[T{U^U)]  in  Lemma  2.  As  the  number  of 
slots  n  increases,  the  gap  between  the  number  of  collisions 
under  the  known  and  unknown  parameters  increases  since  the 
former  converges  to  a  finite  constant  while  the  latter  grows  as 
0(log  n).  The  logarithmic  behavior  of  the  cumulative  number 
of  collisions  can  be  inferred  from  Fig.  2a.  However,  the  curve 
in  Fig.  2c  for  the  unknown  parameter  case  appears  linear  in 
n  due  to  the  small  value  of  n. 

Difference  between  g^^^  and  g^^^ :  Since  the  statistic  g^^^^ 
used  in  the  schemes  in  this  paper  differs  from  the  optimal 
statistic  g^^'^  in  (5),  a  simulation  is  done  to  compare  the  perfor- 
mance of  the  schemes  under  both  the  statistics.  As  expected,  in 
Fig.  2b,  the  optimal  scheme  has  better  performance.  However, 
the  use  of  g^^^^  enables  us  to  provide  finite-time  bounds,  as 
described  earlier. 

Fairness:  One  of  the  important  features  of  p^^^^  is  that 
it  does  not  favor  any  one  user  over  another.  Each  user  has 
an  equal  chance  of  settling  down  in  any  one  of  the  [/-best 
channels.  Fig.  4  evaluates  the  fairness  characteristics  of  p^^^^. 
The  simulation  assumes  [7  =  4  cognitive  users  vying  for 
access  to  C  =  9  channels.  The  graph  depicts  which  user 
asymptotically  gets  the  best  channel  over  1000  runs  of  the 
random  allocation  scheme.  As  can  be  seen,  each  user  has 
approximately  the  same  frequency  of  being  allotted  the  best 
channel  indicating  that  the  random  allocation  scheme  is  indeed 
fair. 

vni.  Conclusion 

In  this  paper,  we  proposed  novel  policies  for  distributed 
learning  of  channel  availability  statistics  and  channel  access 
of  multiple  secondary  users  in  a  cognitive  network.  The  first 
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Fig.    4.  Simulation    Results.    Probability    of   Availability    fx  = 

[0.1,  0.2, . . .  ,  0.9].  No.  of  slots  where  user  has  best  channel  vs.  user.  U  =  4, 
C  =  9,n  =  2500  slots,  1000  runs,  p^^^^. 


policy  assumed  that  the  number  of  secondary  users  in  the 
network  is  known,  while  the  second  policy  removed  this 
requirement.  We  provide  provable  guarantees  for  our  policies 
in  terms  of  sum  regret.  By  noting  the  lower  bound  on  regret  for 
any  uniformly-good  learning  and  access  policy,  we  find  that 
our  first  policy  achieves  order-optimal  regret  while  our  second 
policy  is  also  nearly  order  optimal.  Our  analysis  in  this  paper 
provides  insights  on  incorporating  learning  and  distributed 
medium  access  control  in  a  practical  cognitive  network. 

The  results  of  this  paper  open  up  an  interesting  array  of 
problems  for  future  investigation.  Simulations  suggest  that  our 
lower  and  upper  bounds  are  not  tight  in  terms  of  the  scaling 
constant  and  that  better  bounds  are  needed.  Our  assumptions 
of  an  i.i.d.  model  for  primary  user  transmissions  and  perfect 
sensing  at  the  secondary  users  need  to  be  relaxed.  Our  policy 
allows  for  an  unknown  but  fixed  number  of  secondary  users, 
and  it  is  of  interest  to  incorporate  users  dynamically  entering 
and  leaving  the  system.  Moreover,  our  model  ignores  dynamic 
traffic  at  the  secondary  nodes  and  extension  to  a  queueing- 
theoretic  formulation  is  desirable.  We  consider  the  worst-case 
scenario  that  there  is  no  information  exchange  among  the  sec- 
ondary users.  Extension  to  the  case  with  limited  information 
exchange  is  of  interest.  The  proposed  schemes  are  designed 
to  be  simple  and  tractable  for  mathematical  analysis.  It  is 
desirable  to  obtain  more  robust  schemes  with  additional  design 
considerations  for  real-world  deployment. 
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A.  Proof  of  Theorem  2 


The  result  in  (13)  involves  extending  the  results  of  [11, 
Thm.  1].  Define  Ti{n):=  Yl^=i  (^)  the  number  of  times 
a  channel  i  is  sensed  in  n  rounds  for  all  users.  We  will  show 
that 


m{n)]<  E 

keU-hcst 


_8bgrz_  TT^' 
A(/c*,i)2  3  . 


,    Vz  G  [/-worst. 

(31) 


We  have 


P[Tx.  in  i  in      slot]  =  P[^(/7*;  n)  <  g{i;  n)], 
=¥[A{i;n)n{g{U'';n)<g{i;n))] 
+  P[^^(z;n)n(^([/*;n)<^(z;n))], 

where 

A{i]n):=    y    {g{k;n)  <  g{i;n)) 

keU-hest 

is  the  event  that  at  least  one  of  the  /7-best  channels  has  g- 
statistic  less  than  i.  Hence,  from  union  bound  we  have 

F[A{i;n)]<    ^    F[g{k;n)  <  g{i;n)]. 

keU -best 

We  have  for  C  >U, 

P[^^(z;n)n(^([/*;n)<^(z;n))]=0, 

Hence, 

P[Tx.  in  i  in      round]  <   ^  P[^(^;     <  g{i]  n)]. 

feG-best 

On  the  lines  of  [11,  Thm.  1],  we  have  V/c,z  : 
k  is  U -best,  i  is  U -worst 


< 


1=1 


\  \ogn 
A(A:*,z)2 


Hence,  we  have  (31).  For  the  bound  on  regret,  we  can  break 
R  in  (2)  into  two  terms 

1  ^ 

iGt/-worst  1=1 

+  E  [^EA(r,i)]E[r,(n)]. 

ieU -best  1=1 

For  the  second  term,  we  have 


E  [^EA(r,i)]E[r,(n)] 

ieU -best  1=1 

<E[T*{n)]    J2  [^E^(^*'^) 


ieU -best  1=1 


=  0, 


where  T*(n):=  max  TAn).  Hence,  we  have  the  bound.  □ 

ieU -best 
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B.  Proof  of  Proposition  1 

For  convenience,  let  Ti{n)  :=  "^il^)  •= 

Sj=i  "^ijl^)-  Note  that  Xl^^i^i(^)  =  TiU^  since  each  user 
selects  one  channel  for  sensing  in  each  slot  and  there  are  U 
users.  From  (3), 

u  c 


R{n)  =nY,Kn  -Y.KmV^{n)i 

i=i  i=i 

<    ^  fi{i){n-E[V,{n)]) 

ieU -best 

<M(l*)(nf/-  n^ii^)]) 


ieU -b&st 

=M(l*)(E[M(n)]+    Yl  min)]), 

iGf/-worst 


(32) 
(33) 


where  Eqn.(32)  uses  the  fact  that  Vi{n)  <  n  since  total  number 
of  sole  occupancies  in  n  slots  of  channel  i  is  at  most  n,  and    T\n)  <  I  +      I[A{t^  /)], 
Eqn.(33)  uses  the  fact  that  M(n)  =  EiGC/-best(^*(^)  -  ^ii^))-  ^=2 

For  the  lower  bound,  since  each  user  selects  one  channel 
for  sensing  in  each  slot,  Xl^i  XljLi  =  Now 

T,,,(n)>T/,,,(n). 


Now  for  the  random  allocation  scheme  without  the  genie, 
any  user  not  experiencing  collision  does  not  draw  a  new 
variable  from  Unif(/7).  Hence,  the  number  of  possible  config- 
urations in  any  slot  is  lower  than  under  genie-aided  scheme. 
Since  there  is  only  one  configuration  satisfying  orthogonality^, 
the  probability  of  orthogonality  increases  in  the  absence  of  the 
genie  and  is  at  least  (35).  Hence,  the  number  of  slots  to  reach 
orthogonality  without  the  genie  is  at  most  (34).  Since  in  any 
slot,  at  most  U  collisions  occur,  (17)  holds.  □ 

D.  Proof  of  Lemma  3 

Let  Cn,rn'  =  y^^^' 

Case  1:  Consider  U  =  C  =  2  first.  Let 

^(t,0:={^r^(l*;t  -  1)  <  ^f^^(2*;t  -  l),T;(t  -  1)  > 
On  lines  of  [11,  Thm.  1], 


R{n;  II,  U,  p)  > 


1 

U 

u 


f;f;f;A([/*,i)E[T,,(n)] 

fc=l  i=l  i=l 


<l  +  Y  i^i'dih)  +  ct,h  <  ^2*,i(m)  +  ct, 

t—1  rn-\-h=l 

The  above  event  is  implied  by 

Xi^^j(h)  +  Ct^h  <  X2*j{h)  +  Ct^h+m 


E  A([/*,i)E[T,,,(n)]. 

j—l  i^U -worst 


□ 


C.  Proof  of  Lemma  2 

Although,  we  could  directly  compute  the  time  to  absorption 
of  the  Markov  chain,  we  give  a  simple  bound  E[T(/7,  [/)]  by 
considering  an  i.i.d  process  over  the  same  state  space.  We  term 
this  process  as  a  genie-aided  modification  of  random  allocation 
scheme,  since  this  can  be  realized  as  follows:  in  each  slot,  a 
genie  checks  if  any  collision  occurred,  in  which  case,  a  new 
random  variable  is  drawn  from  Unif([/)  by  all  users.  This  is 
in  contrast  to  the  original  random  allocation  scheme  where  a 
new  random  variable  is  drawn  only  when  the  particular  user 
experiences  a  collision.  Note  that  for  /7  =  2  users,  the  two 
scenarios  coincide. 

For  the  genie-aided  scheme,  the  expected  number  of  slots  to 
hit  orthogonality  is  just  the  mean  of  the  geometric  distribution 


Smce  Ct^m  >  Ct^h+m- 

The  above  event  implies  at  least  one  of  the  following  events 
and  hence,  we  can  use  the  union  bound. 

Xi*j{h)  <  111*  -  ct^h, 

111*  <  112-  +  2Ct^h+m' 

From  the  Chernoff-Hoeffding  bound, 

P[Xi*,,(t)  </ii*  -ct^h]  <t-\ 

nx2*,j  >/i2* <t-\ 

and  the  event  that  /ii*  <  /i2*  +  2ct^h-\-m  implies  that 

81ogt 


Since 


h^m< 

^=1  m=l  h^l 


y 


E[T>;^  =  C  =  2)]<^ 


1  +  ^. 


1 


p 


-p 


P 


<  OO, 


(34) 


where  p  is  the  probability  of  having  an  orthogonal  configura- 
tion in  a  slot.  This  is  in  fact  the  reciprocal  of  the  number  of 
compositions  of  U  [24,  Thm.  5.1],  given  by 

p={  u  )    ■  ^''^ 

The  above  expression  is  nothing  but  the  reciprocal  of  number 
of  ways  U  identical  balls  (users)  can  be  placed  in  U  different 
bins  (channels):  there  are  2/7  —  1  possible  positions  to  form 
U  partitions  of  the  balls. 


^l*,2* 

Case  2:  For  min(/7,  C)  >  2,  we  have 

U       C  n 

T'{n)  <UJ2J2Y1  I{gr''{a*;m)  <  5f™(6*;m)), 

a=l  6=a+l  m—l 

where  a*  and  6*  represent  channels  with  a*  and  6*  highest 
availabilities.  On  lines  of  the  result  for  U  =  C  =  2,  we  can 
show  that 


J2  E/[5f-(a*;m)  <  ^^''(6*;™)]  < 

171=1 

Hence,  (18)  holds. 

^ since  all  users  are  identical  for  this  analysis. 


8  log  n  TT^ 


□ 
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E.  Proof  of  Theorem  3 

Define  the  good  event  as  all  users  having  correct  top  U- 
order  of  the  ^-statistics,  given  by 

u 

S(n):=  P|{Top-[/  entries  of  gj(n)  are  same  as  in  /jl}. 
The  number  of  slots  under  the  bad  event  is 

n 

J2l[9^{m)]=T'{n), 

m=l 

by  definition  of  T'{n).  In  each  slot,  either  a  good  or  a  bad 
event  occurs.  Let  7  be  the  total  number  of  collisions  mU- 
best  channels  between  two  bad  events,  i.e.,  under  a  run  of 
good  events.  In  this  case,  all  the  users  have  the  correct  top 
U -ranks  of  channels  and  hence, 

E[7|S(n)]  <U¥.[T{U,U)]  <  00, 

where  E[T([/,  [/)]  is  given  by  (17).  Hence,  each  transition 
from  the  bad  to  the  good  state  results  in  at  most  /7E[T(/7,  U)] 
expected  number  of  collisions  in  the  /7-best  channels.  The 
expected  number  of  collisions  under  the  bad  event  is  at  most 
U¥.[T'{n)].  Hence,  (19)  holds.  □ 

F.  Proof  of  Lemma  4 

Under  C(n;  U),  a  [/-worst  channel  is  sensed  only  if  it  is 
mistaken  to  be  a    -best  channel.  Hence,  on  lines  of  Lemma  1 , 

E[Ti,^-(n)|C(n;  U)]  =  O(logn),    Vz  G  [/-worst,  j  =  1, . . . ,  /7. 

For  the  number  of  collisions  M(n)  in  the  [/-best  channels, 
there  can  be  at  most  U  Ylk=i  ^(^'  ^)  collisions  in  the  [/-best 
channels  where  a  :=  maxj=i  .  .  .  f/  Uj  is  the  maximum  estimate 
of  number  of  users.  Conditioned  on  C{n;U^),  a  <  [/,  and 
hence,  we  have  (24).  □ 

G.  Proof  of  Proposition  2 

Define  the  good  event  as  all  users  having  correct  top  U- 
order,  given  by 

u 

S(n):=  P|{Top-[/  entries  of  gj(n)  are  same  as  in  /jl}. 
The  number  of  slots  under  the  bad  event  is 

n 

J2l[9^{m)]=T'{n), 

by  definition  of  T'{n).  In  each  slot,  either  a  good  or  a  bad 
event  occurs.  Let  7  be  the  total  number  of  collisions  in  /c-best 
channels  between  two  bad  events,  i.e.,  under  a  run  of  good 
events.  In  this  case,  all  the  users  have  the  correct  top  U -ranks 
of  channels  and  hence, 

7|S(n)  <  UT{U,k), 

The  number  of  collisions  under  the  bad  event  is  at  most  T'(n). 
Hence,  (27)  holds.  □ 


H.  Proof  of  Lemma  6 
We  are  interested  in 

P[C^(n);  U]  =  P[u7=i[/f  ^(n)  >  [/], 

n  U 

=  n[j  \J{<^uAm)  >  an;U)}], 

m—1  j=l 

=  P[  max   ^uj{n)  >  ^(n;[/)], 

j=l,...,U 

where  ^  is  given  by  (20).  For  [/  =  1,  we  have  ¥[C^{n);  [/]  =  0 
since  no  collisions  occur. 
Using  (27)  in  Proposition  2, 

P[max^/e  .-(n)  >  ^{n;k)] 

<  P[/cT([/,  k){T\n)  +  1)  >  ^(n;  k)] 

<  nk{r{n)  +  1)  >  ^^^]  +  P[T([/,  k)  >  an] 

4(n;  A:) 

using  Markov  inequality.  By  choosing  an  =  cc;(l),  the  second 
term  in  (36),  viz.,  P[T([/, /c)  >  an]  ^  0  as  n  ^  00,  for 
k>U.  For  the  first  term,  from  (26)  in  Lemma  5,  E[T'(n)]  = 
O(logn).  Hence,  by  choosing  an  =  o(^*(n; /c)/ log  n),  the 
first  term  decays  to  zero.  Since  ^*(n;  U)  =  cc;(logn),  we  can 
choose  an  satisfying  both  the  conditions.  By  letting  k  =  U  in 
(36),  we  have  P[C^(n);  [/]  ^  0  as  n  ^  00,  and  (28)  holds.  □ 
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A^s^mc^— Managing  trust  in  a  distributed  Mobile  Ad  Hoc  Net- 
work (MANET)  is  challenging  when  collaboration  or  cooperation 
is  critical  to  achieving  mission  and  system  goals  such  as  reliability, 
availability,  scalability,  and  reconfigurability.  In  defining  and 
managing  trust  in  a  military  MANET,  we  must  consider  the 
interactions  between  the  composite  cognitive,  social,  information 
and  communication  networks,  and  take  into  account  the  severe 
resource  constraints  (e.g.,  computing  power,  energy,  bandwidth, 
time),  and  dynamics  (e.g.,  topology  changes,  node  mobility,  node 
failure,  propagation  channel  conditions).  We  seek  to  combine 
the  notions  of  "social  trust"  derived  from  social  networks  with 
"quality-of-service  (QoS)  trust"  derived  from  information  and 
communication  networks  to  obtain  a  composite  trust  metric.  We 
discuss  the  concepts  and  properties  of  trust  and  derive  some 
unique  characteristics  of  trust  in  MANETs,  drawing  upon  social 
notions  of  trust.  We  provide  a  survey  of  trust  management 
schemes  developed  for  MANETs  and  discuss  generally  accepted 
classifications,  potential  attacks,  performance  metrics,  and  trust 
metrics  in  MANETs.  Finally,  we  discuss  future  research  areas 
on  trust  management  in  MANETs  based  on  the  concept  of  social 
and  cognitive  networks. 

Index  Terms— Trust  management,  mobile  ad  hoc  networks, 
social  networks,  cognitive  networks,  trust,  trust  metrics. 

I.  Introduction 

IN  AN  INCREASINGLY  networked  world,  increased  con- 
nectivity could  lead  to  improved  information  sharing,  fa- 
cilitate collaboration,  and  enable  distributed  decision  making, 
which  is  the  underlying  concept  in  Network  Centric  Opera- 
tions. In  mobile  ad  hoc  networks  (MANETs),  the  distributed 
decision  making  should  take  into  account  trust  in  the  elements: 
the  sources  of  information,  the  processors  of  information,  the 
elements  of  the  communications  network  across  which  the 
information  is  transmitted,  etc.  This  trust  must  often  be  derived 
under  time-critical  conditions,  and  in  a  distributed  way. 

A.  Design  Challenges  in  MANET  Protocols 

A  mobile  ad  hoc  network  [1]  consists  of  wireless  mobile 
nodes  forming  a  temporary  network  without  the  help  of  cen- 
tralized infrastructure,  and  where  nodes  communicate  through 
multi-hops. 

Security  protocol  designers  for  MANETs  face  technical 
challenges  due  to  severe  resource  constraints  in  bandwidth. 
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memory  size,  battery  life,  computational  power,  and  unique 
wireless  characteristics  such  as  openness  to  eavesdropping, 
lack  of  specific  ingress  and  exit  points,  high  security  threats, 
vulnerability,  unreliable  communication,  and  rapid  changes 
in  topologies  or  memberships  because  of  user  mobility  or 
node  failure  [1][2][3].  In  addition,  compared  with  designing 
security  protocols  for  civilian  MANETs,  designing  security 
protocols  for  military  MANETs  requires  additional  caution, 
since  battlefield  communication  networks  must  cope  with 
hostile  environments,  node  heterogeneity,  often  stringent  per- 
formance constraints,  node  subversion,  high  tempo  operations 
leading  to  rapid  changes  in  network  topology  and  service 
requirements,  and  dynamically  formed  communities  of  in- 
terest wherein  participants  may  not  have  predefined  trust 
relationships  [4].  To  cope  with  these  dynamics,  networks 
must  be  able  to  reconfigure  seamlessly,  via  low-complexity 
distributed  network  management  schemes  [3].  Security  in  a 
tactical  network  includes  notions  of  communication  security 
which  can  be  easily  quantified  as  opposed  to  the  perception 
of  security  which  is  hard  to  quantify. 

B.  Motivation  for  Trust  Management  in  MANETs 

The  concept  of  "Trust"  originally  derives  from  social  sci- 
ences and  is  defined  as  the  degree  of  subjective  belief  about 
the  behaviors  of  a  particular  entity  [5].  Blaze  et  al.  [6]  first 
introduced  the  term  "Trust  Management"  and  identified  it  as 
a  separate  component  of  security  services  in  networks  and 
clarified  that  "Trust  management  provides  a  unified  approach 
for  specifying  and  interpreting  security  policies,  credentials, 
and  relationships." 

Trust  management  in  MANETs  is  needed  when  partic- 
ipating nodes,  without  any  previous  interactions,  desire  to 
establish  a  network  with  an  acceptable  level  of  trust  rela- 
tionships among  themselves.  Examples  would  be  in  building 
initial  trust  bootstrapping  [7],  coalition  operations  without 
predefined  trust,  and  authentication  of  certificates  generated 
by  another  party  when  links  are  down  or  ensuring  safety 
before  entering  a  new  zone  [8].  In  addition,  trust  management 
has  diverse  applicability  in  many  decision  making  situations 
including  intrusion  detection,  authentication,  access  control, 
key  management,  isolating  misbehaving  nodes  for  effective 
routing,  and  other  purposes. 

Trust  management,  including  trust  establishment,  trust  up- 
date, and  trust  revocation,  in  MANETs  is  also  much  more 
challenging  than  in  traditional  centralized  environments.  For 
example,  collecting  trust  information  or  evidence  to  evalu- 
ate trustworthiness  is  difficult  due  to  changes  in  topology 
induced  by  node  mobility  or  node  failure.  Further,  resource 
constraints  often  confine  the  trust  evaluation  process  only 
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Fig.  1.    The  multidisciplinary  concept  of  trust  and  its  application  in  communications  and  networking. 


to  local  information.  The  dynamic  nature  and  characteristics 
of  MANETs  result  in  uncertainty  and  incompleteness  of  the 
trust  evidence,  which  is  continuously  changing  over  time  [8] 
[9].  Despite  a  couple  of  surveys  of  trust  management  [10] 
[11]  [12],  a  comprehensive  survey  of  trust  management  in 
MANETs  does  not  exist  and  is  the  main  aim  of  this  paper. 
A  short  version  of  this  paper  was  presented  at  ICCRTS  2009 
[13].  The  contributions  of  this  paper  are:  (1)  to  give  a  clear 
definition  of  trust  in  the  communication  and  networking  field, 
drawing  upon  definitions  from  different  disciplines;  (2)  to 
extensively  survey  the  existing  trust  management  schemes 
developed  for  MANETs  and  investigate  their  general  trends; 
and  (3)  to  discuss  future  research  areas  based  on  the  concept 
of  social  and  cognitive  networks. 

The  rest  of  this  paper  is  organized  as  follows.  In  Section  2, 
we  discuss  the  concept  of  trust  in  diverse  disciplines,  give  a 
clear  distinction  between  trust  and  trustworthiness,  and  discuss 
the  relationship  between  trust  and  risk.  We  also  introduce 
the  main  properties  of  trust  in  MANETs.  Section  3  surveys 
generally  accepted  classifications  of  trust  management,  at- 
tacks considered  in  existing  trust  management  schemes  for 
MANETs,  and  metrics  used  to  measure  the  performance 
of  existing  MANET  trust  management  schemes.  Section  4 
surveys  trust  management  schemes  that  have  been  developed 
for  specific  purposes,  including  secure  routing,  authentication, 
intrusion  detection,  access  control,  key  management,  and  trust 
evidence  distribution  and  evaluation.  In  Section  5,  we  discuss 
design  concepts  that  designers  of  MANET  trust  management 
systems  should  keep  in  mind  and  suggest  trust  metrics  based 
on  the  concepts  of  social  trust  and  quality-of- service  (QoS) 
trust.  Section  6  concludes  this  paper. 

II.  Concepts  and  Properties  of  Trust 

In  this  section,  we  review  how  trust  is  defined  in  different 
disciplines  and  how  these  trust  concepts  can  be  applied  in 
modeling  trust  in  MANETs.  Further,  we  examine  the  relation- 
ship between  trust  and  risk,  and  how  trust  should  be  defined 
in  order  to  realistically  reflect  the  unique  characteristics  of 
MANETs. 

A.  Multidisciplinary  Concept  of  Trust 

According  to  Merriam  Webster's  Dictionary  [14],  trust  is 
defined  as  "assured  reliance  on  the  character,  ability,  strength. 


or  truth  of  someone  or  something."  Despite  the  subjective 
nature  of  trust,  the  concept  of  trust  has  been  very  attractive 
to  network  security  protocol  designers  because  of  its  diverse 
applicability  as  a  decision  making  mechanism.  We  examine 
the  literature  to  study  how  trust  is  defined  in  various  disci- 
plines including  sociology,  economics,  philosophy,  psychol- 
ogy, organizational  management,  and  autonomic  computing 
in  industrial  and  system  engineering.  Finally,  we  also  examine 
how  trust  can  be  defined  in  communications  and  networking 
with  the  help  of  definitions  in  other  fields. 

Trust  in  sociology.  Gambetta's  notion  of  trust  [15]  is  popu- 
larly called  sociological  trust  and  is  defined  as  an  assessor's  a 
priori  subjective  probability  that  a  person  (or  agent,  or  group) 
will  perform  specific  actions  that  affect  the  assessor.  That  is, 
Gambetta  [15]  describes  the  nature  of  trust  as  subjectivity, 
an  indicator  for  future  actions,  and  dynamicity  based  on 
continuous  interactions  between  two  entities.  Luhmann  [16] 
also  emphasized  the  importance  of  trust  in  society  as  a 
mechanism  for  building  cooperation  among  people  to  extend 
human  interactions  for  future  collaboration.  Adams  et  al.  [17] 
rephrased  Gambetta's  trust  concept  in  applying  the  sociolog- 
ical concept  of  trust  in  computer  science;  they  represented 
trust  as  a  continuous  variable,  quantifying  trust  in  the  light 
of  context  or  acceptance  of  risk.  They  further  stressed  that 
risking  betrayal  is  an  important  aspect  in  building  trust.  To 
be  useful,  network  trust  models  must  capture  this  subjective 
aspect  of  social  trust. 

Trust  in  economics:  Economists  distinguish  between  the 
personal,  informal  trust  that  comes  from  being  friendly  with 
your  neighbors  and  the  impersonal,  institutionalized  trust  that 
lets  you  give  your  credit  card  number  out  over  the  Internet 
[18].  Both  notions  of  trust  are  important  in  military  MANETs. 
In  economics,  trust  is  represented  as  an  expectation  that 
applies  to  situations  in  which  trustors  take  risky  actions  under 
uncertainty  or  information  incompleteness  [19].  However,  as 
illustrated  in  the  Prisoner's  Dilemma  (PD)  game  [20],  trust  in 
economics  is  based  on  the  assumption  that  humans  are  rational 
and  strict  utility  maximizers  of  their  own  interest  or  incentives. 
In  this  sense,  when  we  apply  a  human  trust  model  to  a  network 
trust  model,  the  assumption  of  selfish  nodes  seems  reasonable. 
But  altruistic  behaviors  can  emerge  from  mechanisms  that  may 
be  initially  purely  selfish  [21],  and  thus  making  an  argument 
for  redemption  mechanisms.  Economic  models  are  used  in 
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conjunction  with  trust-based  encryption  primitives  in  [22]  to 
develop  a  trust  management  paradigm  for  securing  information 
flows  across  organizations. 

Trust  in  philosophy.  According  to  the  Stanford  Encyclope- 
dia of  Philosophy  [23],  trust  is  important  but  dangerous.  Since 
trust  aUows  us  to  form  relationships  with  others  and  to  rely  on 
others  for  love,  advice,  help,  etc.,  trust  is  regarded  as  a  very 
important  factor  in  our  life  that  compels  others  to  give  us  such 
things  with  no  outside  force  such  as  the  law.  On  the  other 
hand,  since  trust  requires  taking  a  risk  that  the  trustee  may 
not  behave  as  the  trustor  expects,  trust  is  dangerous  implying 
the  possible  betrayal  of  trust.  In  his  comments  on  Lagerspetz's 
book  titled  Trust:  The  Tacit  Demand,  Lahno  [24]  describes  the 
author's  view  on  trust  as  a  moral  relationship  in  human  society. 
Langerspetz  believes  that  investigations  of  trust  reveal  that 
"human  individuals,  their  beliefs,  desires  and  actions  are  only 
intelligible  against  the  background  of  existing  social  practices 
and  social  ties"  [24].  This  implies  that  depending  on  the  nature 
of  personal  relationships  between  a  trustor  and  a  trustee  (i.e., 
moral  relationship  between  them),  trustful  actions  or  betrayal 
can  occur. 

Trust  in  psychology:  According  to  the  Wikipedia  definition 
of  trust  in  psychology  [25],  trust  starts  from  the  birth  of  the 
child.  As  the  child  grows  older,  trust  also  grows  stronger. 
However,  the  root  of  trust  derives  from  the  relationship  be- 
tween mother  (or  caregiver)  of  the  child  since  the  strength  of 
the  family  relies  on  trust,  if  the  child  is  raised  in  a  family 
which  is  very  accepting  and  loving,  the  child  also  returns 
those  feelings  to  others  by  trusting  them.  But  if  trust  is  lost, 
it  is  hard  to  regain  it.  In  this  sense,  trust  in  psychology 
emphasizes  the  cognitive  process  that  human  beings  learn 
trust  from  their  experiences.  Deutsch  [26]  defines  trust  as  the 
confidence  that  one  will  find  what  is  desired  from  another 
rather  than  what  is  feared.  An  individual  may  be  said  to 
have  trust  in  the  occurrence  of  an  event  if  he  expects  its 
occurrence  and  his  expectation  leads  to  the  behavior  which 
he  perceives  to  have  greater  negative  consequences  if  the 
expectation  is  not  confirmed  than  positive  consequences  if  it  is 
confirmed.  In  addition,  Hardin  [27]  and  Rotter  [28]  observed 
in  their  experiments  that  past  experience  may  strikingly  affect 
later  capacity  for  trust.  For  example,  bad  experience  with 
people  will  lower  the  trust  level,  leading  to  fewer  trusted 
relationships  with  people,  and  thus  fewer  opportunities  for 
mutual  gain.  Further,  they  recognized  that  the  gains  obtained 
by  having  high  trust  relationships  exceed  the  loss  by  having 
low  trust  relationships.  For  instance,  high  trustors  are  less 
likely  to  lie  or  cheat  or  steal.  Also  they  are  less  likely  to  be 
unhappy,  conflicted,  or  unstable,  and  sought  by  more  friends. 
Even  though  high  trustors  are  deceived  more  often  in  novel 
situations,  low  trustors  are  also  fooled  equally  by  distrusting 
trustworthy  people,  thereby  losing  the  advantages  that  high 
trustors  may  have  [28]. 

Trust  in  organizational  management:  In  this  field,  the 
concept  of  trust  is  also  defined  as  the  extent  to  which  one  party 
is  willing  to  count  on  someone  or  something  with  a  feeling 
of  relative  security  in  spite  of  possible  negative  consequences, 
emphasizing  the  possibility  of  facing  risk  [29].  Schoorman 
et  al.  [30]  defined  trust  as  the  willingness  to  take  a  risk  or 
willingness  to  be  vulnerable  in  the  relationship  in  terms  of 


ability,  integrity,  and  benevolence.  They  also  explained  that 
trust  is  not  necessarily  mutual  and  is  not  reciprocal.  Trust 
concepts  in  organizational  management  can  give  us  insights 
on  how  to  measure  trust  by  investigating  methods  to  measure 
ability,  integrity,  and  benevolence  of  each  networked  node, 
as  well  as  on  assessing  risk.  They  can  also  give  us  insights 
on  defining  group  trust  (i.e.,  between  a  person  and  a  group  or 
between  groups)  which  is  important  for  dynamic  communities 
of  interest. 

Trust  in  autonomic  computing:  As  technology  becomes 
more  complex,  fully  understanding  automation  becomes  in- 
feasible,  if  not  impossible,  and  trust  in  automation  becomes 
critical,  particularly  when  unexpected  situations  arise  and 
system  responses  cannot  be  predicted.  Researchers  studying 
autonomic  computing  in  industrial  systems  engineering  have 
sought  to  develop  models  of  trust  to  understand  how  trust  in 
automation  develops  and  how  it  may  be  misplaced.  Lee  and 
See  [31]  define  trust  as  the  attitude  that  an  agent  will  help 
accomplish  an  individual's  goals  in  a  situation  with  uncertainty 
and  vulnerability.  In  this  sense,  an  agent  can  be  automation  or 
another  person  that  actively  interacts  with  the  environment  on 
behalf  of  the  person.  Parasuraman  [32]  links  the  level  of  trust 
with  automation  reliability  stating  that  "Trust  often  determines 
automation  usage.  Operators  may  not  use  a  reliable  automated 
system  if  they  believe  it  to  be  untrustworthy."  The  notion  of 
automation  reliability  as  a  trust  metric  is  one  that  is  applicable 
in  MANETs,  where  the  user's  trust  in  reliability  on  technology 
is  an  important  aspect. 

Trust  in  communications  and  networking:  The  concept  of 
trust  also  has  been  attractive  to  communication  and  network 
protocol  designers  where  trust  relationships  among  participat- 
ing nodes  are  critical  in  building  cooperative  and  collabora- 
tive environments  to  optimize  system  objectives  in  terms  of 
scalability,  reconfigurability,  and  reliability  (i.e.,  survivability), 
dependability,  or  security.  According  to  Eschenauer  et  al.  [9], 
trust  is  defined  as  "a  set  of  relations  among  entities  that 
participate  in  a  protocol.  These  relations  are  based  on  the 
evidence  generated  by  the  previous  interactions  of  entities 
within  a  protocol.  In  general,  if  the  interactions  have  been 
faithful  to  the  protocol,  then  trust  will  accumulate  between 
these  entities."  Capra  [34]  proposes  to  use  a  human  trust 
model  based  on  human  interactions  in  a  trust  model  for  fully 
distributed  network  environments  such  as  MANETs.  Capra 
defines  trust  as  the  degree  of  a  belief  about  the  behavior  of 
other  entities  (or  agents).  Li  and  Singhal  [35]  define  trust 
as  the  belief  that  an  entity  is  capable  of  performing  reliably, 
dependably,  and  securely  in  a  particular  case;  hence,  different 
levels  of  trust  exist  in  different  contexts.  For  example,  Alice 
may  trust  her  physician  to  give  her  advice  on  her  health 
concerns  but  may  not  trust  her  physician's  advice  on  fixing 
her  car.  Aivaloglou  et  al.  [36]  describe  trust  as  the  quantified 
belief  of  a  trustor  regarding  competence,  honesty,  security,  and 
dependability  of  a  trustee  in  a  specific  context. 

Recently,  researchers  have  recognized  the  importance  of 
social  networks  in  building  trust  relationships  among  entities. 
Golbeck  [37]  [38]  [39]  introduces  the  concept  of  social  trust 
by  suggesting  the  use  of  social  networks  as  a  bridge  to 
build  trust  relationships  among  entities.  Golbeck  proposes  the 
application  of  a  trust  concept  derived  from  a  sociological 
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Fig.  2.    Trust  level  [42]. 

viewpoint  to  computer  science,  and  describes  trust  as  a  well- 
defined  descriptor  of  security  and  encryption  as  a  metric  to 
reflect  security  goals.  Wong  and  Sycara  [40]  introduce  security 
mechanisms  to  establish  trust  in  multi-agent  systems.  They  are 
concerned  with  both  authenticating  agents  as  well  as  ensuring 
that  agents  do  not  misbehave.  Trustworthiness  emerges  from 
the  security  features  in  their  system. 

From  the  definitions  of  trust  derived  from  various  fields 
as  reviewed  above,  we  can  construct  a  trust  metric  having 
the  following  characteristics:  (1)  trust  should  be  established 
based  on  potential  risks;  (2)  trust  should  be  context-dependent; 
(3)  trust  should  be  based  on  each  party's  own  interest  (e.g., 
selfishness);  (4)  trust  is  learned  (i.e.,  a  cognitive  process);  and 
(5)  trust  may  represent  system  reliability. 

B.  Trust,  Trustworthiness,  and  Risk 

In  the  literature,  the  terms  trust  and  trustworthiness  seem  to 
be  used  interchangeably  without  clear  distinction.  Josang  et  al. 
[41]  clarified  the  difference  between  trust  and  trustworthiness 
based  on  definitions  provided  by  Gambetta  [15].  Level  of 
trust  is  defined  as  the  belief  probability  varying  from  0 
(complete  distrust)  to  1  (complete  trust)  [41].  In  this  sense, 
trustworthiness  is  a  measure  of  the  actual  probability  that  the 
trustees  will  behave  as  expected.  Solhaug  et  al.  [42]  define 
trustworthiness  as  the  objective  probability  that  the  trustee 
performs  a  particular  action  on  which  the  interests  of  the 
trustor  depend. 

Figure  2  [42]  explains  how  trust  (i.e.,  subjective  probability 
of  trust  level)  and  trustworthiness  (i.e.,  objective  probability  of 
trust  level)  can  differ  and  how  the  difference  affects  the  level 
of  risk  the  trustor  needs  to  take.  The  diagonal  dashed  line  is 
assumed  to  be  marks  of  well-founded  trust  in  which  trust  is 
equivalent  to  trustworthiness. 

Depending  on  the  extent  to  which  the  trustor  is  ignorant 
about  the  difference  between  the  believed  (i.e.,  trust)  and  the 
actual  (i.e.,  trustworthiness)  probability,  there  is  a  miscalcu- 
lation of  the  involved  risk.  That  is,  the  subjective  aspect  of 
trust  results  in  incorrect  risk  estimation  and  improper  risk 
management  accordingly.  Figure  2  shows  the  cases  in  which 
the  probability  is  miscalculated.  In  the  area  below  the  diagonal 
line,  there  is  misplaced  trust  to  various  degrees  that  the 
perceived  trust  is  higher  than  the  actual  trustworthiness.  Even 


though  risk  is  an  intrinsic  characteristic  of  trust  even  in  well- 
founded  trust,  misplaced  trust  increases  risk  and  thus  enhances 
the  chance  of  deceit  as  well,  as  shown  in  the  example  marked 
with  b  in  Figure  2.  On  the  other  hand,  when  the  perceived 
trust  is  lower  than  the  actual  trustworthiness  as  shown  in  the 
example  marked  with  a,  the  trustee  is  distrusted  more  than 
warranted.  In  this  case,  the  trustor  may  lose  potentially  good 
opportunities  to  cooperate  with  partners  with  high  trustwor- 
thiness. 

From  the  above  discussions,  we  can  conclude  that  careful 
risk  estimation  is  closely  linked  with  building  accurate  trust 
relations  among  participating  entities  in  networks.  However, 
Josang  et  al.  [41]  argue  that  objective  trust  may  not  be  appli- 
cable to  decision  making  in  real  situations.  They  define  two 
interesting  types  of  trust:  1)  a  context  independent  reliability 
trust  which  measures  the  perceived  reliability  by  another  party 
regardless  of  the  situations  which  the  trustor  might  face  by 
recognizing  possible  risk;  2)  decision  trust  as  "the  extent 
to  which  a  given  party  is  willing  to  depend  on  something 
or  somebody  in  a  given  situation  with  a  feeling  of  relative 
security  even  though  negative  consequences  are  possible." 
Decision  trust  deals  with  components  such  as  utility  and  risk 
attitude.  As  an  example,  one  may  not  trust  an  old  rope  for 
climbing  down  from  the  3rd  floor  of  a  building  during  a  fire 
exercise  (i.e.,  reliability  trust)  while  trusting  the  rope  in  a  real 
fire  (i.e.,  decision  trust). 

The  relationship  between  trust  and  risk  has  been  investi- 
gated in  [41]  [42].  Figure  3  shows  an  example  of  three  different 
risk  values:  low,  medium,  and  high.  The  value  of  risk  is  low 
for  all  trust  values  when  the  stake  is  close  to  zero.  Similarly, 
if  the  stake  is  too  high,  risk  is  regarded  as  high  regardless  of 
the  estimated  trust  value.  Risk  is  generally  low  when  the  trust 
value  is  high.  However,  the  risk  value  should  be  determined 
based  on  the  value  at  stake  (e.g.,  risk  probability)  since  as 
shown  in  Figure  3,  high  risk  exists  even  for  the  case  of  trust 
value  =  1.  Also  important  are  the  aspects  (or  probability)  of 
opportunity  and  prospect  (or  the  positive  consequence  of  an 
opportunity)  [41]  [42].  To  buy  rubber  is  to  do  risky  business, 
but  it  also  gives  the  opportunity  of  selling  refined  products 
with  net  profit.  The  purchaser  of  rubber  should  estimate  her/his 
acceptable  risk  level  in  terms  of  the  calculated  prospects. 
Josang  et  al.  [41]  and  Solhaug  et  al.  [42]  conclude  that  trust 
is  generally  neither  proportional  nor  inversely  proportional  to 
risk. 

Some  researchers  have  commented  that  trust  and  uncertainty 
are  intimately  linked  -  trust  is  a  mechanism  to  cope  with 
uncertainty.  The  level  of  uncertainty  in  the  information  used 
as  trust  evidence  will  also  considerably  influence  the  accuracy 
of  trust  evaluation  [43]. 

C.  Trust  Properties  in  MANETs 

Due  to  the  unique  characteristics  of  MANET  environments 
and  the  inherent  unreliability  of  the  wireless  channel,  the 
concept  of  trust  in  MANETs  should  be  carefully  defined. 
The  main  properties  of  trust  in  MANET  environments  can 
be  summarized  as  follows  (see  Figure  4): 

First,  trust  is  dynamic,  not  static.  Trust  establishment  in 
MANETs  should  be  based  on  temporally  and  spatially  local 
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-  Fig.  5.    Trust  properties  in  trust  management  schemes  in  MANETs. 
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Fig.  4.    Trust  properties  in  MANETs. 


information:  due  to  node  mobility  or  failure,  information  is 
typically  incomplete  and  can  change  rapidly  [8]  [32].  Adams 
et  al.  [44]  point  out  that  in  order  to  capture  the  dynamicity  of 
trust,  trust  should  be  expressed  as  a  continuous  variable,  rather 
than  as  a  binary  or  even  discrete- valued  entity.  A  continuous 
valued  variable  can  represent  uncertainty  better  than  a  binary 
variable. 

Second,  trust  is  subjective  [45].  In  MANET  environments, 
a  trustor  node  may  determine  a  different  level  of  trust  against 
the  same  trustee  node  due  to  different  experiences  with  the 
node  derived  from  a  dynamically  changing  network  topology. 

Third,  trust  is  not  necessarily  transitive  [46].  For  example, 
if  A  trusts  B,  and  B  trusts  C,  it  does  not  guarantee  A  trusts  C. 
In  order  to  use  the  transitivity  of  trust  between  two  entities  to 
a  third  party,  a  trustor  should  maintain  two  types  of  trust:  trust 
in  a  trustee  and  trust  in  the  trustee's  recommendation  of  the 
third  party.  For  example,  Alice  may  trust  Bob  about  movies, 
but  not  trust  him  at  all  to  recommend  other  people  whose 
opinion  about  movies  is  worth  considering  or  not  trust  other 
people  that  Bob  recommended  as  much  as  she  trusts  Bob. 

Fourth,  trust  is  asymmetric,  not  necessarily  reciprocal  [44] . 
In  heterogeneous  MANETs,  nodes  with  higher  capability  (e.g., 
more  energy  or  computational  power)  may  not  trust  nodes 


with  lower  capability  at  the  same  level  that  nodes  with  lower 
capability  trust  nodes  with  higher  capability.  As  a  typical 
example  in  organizational  management,  a  supervisor  tends  to 
trust  an  employee  less  than  the  employee  trusts  the  supervisor. 

Fifth,  trust  is  context-dependent  [33].  For  example,  A  may 
trust  B  as  a  wine  expert  but  not  as  a  car  fixer.  Similarly  in 
MANETs  depending  on  the  given  task,  different  types  of  trust 
(e.g.,  trust  in  computational  power  or  trust  in  unselfishness, 
trust  in  forwarding  versus  trust  in  reporting)  are  required. 

Figure  5  shows  how  several  trust  properties  are  considered 
in  the  literature.  Dynamicity  and  weighted  transitivity  are 
most  often  considered.  However,  we  notice  that  some  existing 
work  does  not  even  consider  trust  properly;  some  represent 
trust  as  a  discrete  variable,  while  others  assume  that  trust 
is  symmetric  or  completely  transitive.  As  such  they  do  not 
capture  characteristics  of  trust  in  a  MANET.  Further,  we  could 
not  find  any  prior  work  that  comprehensively  considers  all  five 
properties  of  trust  shown  in  Figure  5.  Note  that  Figure  5  is 
based  on  36  papers  and  each  work  may  consider  multiple  trust 
properties. 

In  order  to  properly  take  into  account  these  unique  charac- 
teristics of  trust  in  MANETs  as  described  above,  any  trust- 
based  framework  for  MANETs  should  consider  the  following 
as  well: 

First,  a  decision  procedure  to  determine  the  trust  of  an  entity 
should  be  fully  distributed  based  on  cooperative  evaluation 
with  uncertain  and  incomplete  evidence,  since  one  cannot 
rely  on  a  trusted  third  party  such  as  a  trusted  centralized 
certificate  authority  to  take  care  of  trust  management  as  in 
wired  networks  [8]  [9]  [34]. 

Second,  trust  should  be  determined  in  a  highly  customizable 
way  (e.g.,  flexible  to  membership  changes  and  to  deployment 
scenarios)  without  causing  disruption  to  the  device  computa- 
tion and  communication  resources  while  capturing  the  various 
and  complicated  natural  components  of  an  individual's  trust 
into  a  network  model  [34]  [47]. 

Third,  a  trust  decision  framework  should  not  assume  that 
all  nodes  are  cooperative  [34].  In  resource-restricted  environ- 
ments, selfishness  is  likely  to  be  prevalent  over  cooperation,  for 
example,  in  order  to  save  battery  life  or  computational  power. 
Thomas  et  al.  [48]  discuss  the  tradeoff  between  selfishness 
and  altruism  of  participating  nodes  in  MANETs  in  terms  of 
prolonging  system  lifetime  (e.g.,  with  system  lifetime  defined 
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as  the  time  to  a  node's  death  due  to  energy  exhaustion)  versus 
reducing  selfish  behaviors  to  enhance  system  throughput. 

Finally,  trust  should  be  established  in  a  self-organized 
reconfigurable  way  in  order  not  to  be  disrupted  by  the  dy- 
namics of  MANET  environments  [8]  [48].  In  addition  to  the 
characteristics  mentioned  above,  trust-based  frameworks  for 
MANETs  should  consider  the  tradeoff  issues  between  security 
and  performance  including  reliability,  fault  tolerance,  scalabil- 
ity, and  energy  consumption  where  resources  are  restricted  but 
security  vulnerability  is  relatively  high. 

III.  Classifications,  Potential  Attacks,  and 
Metrics  for  MANET  Trust  Management 

This  section  discusses  classifications,  attacks  and  perfor- 
mance metrics  for  MANET  trust  management.  Before  review- 
ing the  literature,  we  would  like  to  clarify  some  terminologies 
that  have  been  used  interchangeably  but  sometimes  confus- 
ingly in  the  context  of  trust  management. 

In  general,  the  term  trust  management  is  interchangeably 
used  with  the  term  reputation  management  [35].  However, 
there  is  a  slight  difference  between  trust  and  reputation. 
According  to  Liu  et  al.  [49],  trust  is  active  while  reputation  is 
passive.  That  is,  trust  is  a  node's  belief  in  the  trust  qualities 
of  a  peer,  thus  being  extended  from  a  node  to  its  peer. 
Reputation  is  the  perception  that  peers  form  about  a  node. 
Further,  Ruhomaa  et  al.  [10]  distinguish  trust  from  reputation, 
noting  that  trust  puts  an  emphasis  on  risk  and  incentives 
while  reputation  focuses  on  a  perception  that  a  party  creates 
through  past  actions  about  its  intentions  in  the  context  of  the 
norms  effective  within  a  community.  Also,  recommendation 
is  frequently  used  as  a  way  to  measure  trust  or  reputation. 
Recommendation  is  simply  an  attempt  at  communicating  a 
party's  reputation  from  one  community  context  to  another 
[45][10]. 

A  working  definition  of  trust  for  Internet  applications,  and 
a  survey  of  trust  management  schemes  for  such  applications 
may  be  found  in  [12]. 

In  most  of  the  literature,  reputation  management  is  re- 
garded as  part  of  trust  management.  Further,  the  terms  trust 
management  and  trust  establishment  are  also  interchangeably 
used.  To  clarify  these  two  terms,  according  to  Aivaloglou  et 
al.  [36],  trust  establishment  is  a  process  to  deal  with  the 
representation,  evaluation,  maintenance,  and  distribution  of 
trust  among  nodes. 

Trust  management  deals  with  problems  such  as  the  formu- 
lation of  evaluation  rules  and  policies,  representation  of  trust 
evidence,  and  evaluation  and  management  of  trust  relation- 
ships among  nodes.  As  Figure  6  explains,  trust  establishment 
is  one  of  several  trust  management  tasks. 

A.  Classifications 

According  to  Solhaug  et  al.  [42],  trust  management  is  a 
special  case  of  risk  management  with  a  particular  emphasis 
on  authentication  of  entities  under  uncertainty  and  decision 
making  on  cooperation  with  unknown  entities.  However,  the 
application  of  trust  management  has  been  extended  from 
authentication  to  various  aspects  of  communications  and 
networking,  including  secure  routing  for  isolating  malicious 


or  selfish  nodes,  intrusion  detection,  key  management,  ac- 
cess control,  and  other  decision  making  mechanisms.  Trust 
management  includes  trust  establishment  (i.e.,  collection  of 
appropriate  trust  evidence,  trust  generation,  trust  distribution, 
trust  discovery,  and  evaluation  of  trust  evidence),  trust  update, 
and  trust  revocation  [50]  [42].  This  section  surveys  popularly 
used  classifications  of  trust  management  (or  establishment). 

Li  et  al.  [51]  and  Li  et  al.  [52]  classify  trust  management 
as  reputation-based  framework  and  trust  establishment  frame- 
work. A  reputation-based  framework  uses  direct  observations 
and  second-hand  information  distributed  among  nodes  in  a 
network  to  evaluate  a  node.  A  trust  establishment  framework 
evaluates  neighboring  nodes  based  on  direct  observations 
while  trust  relations  between  two  nodes  without  prior  direct 
interactions  are  built  through  a  combination  of  opinions  from 
intermediate  nodes. 

Yonfang  [53]  suggests  two  different  approaches  to  evaluate 
trust:  policy-based  trust  management  and  reputation-based 
trust  management.  Policy-based  trust  management  is  based 
on  strong  and  objective  security  schemes  such  as  logical  rules 
and  verifiable  properties  encoded  in  signed  credentials  for 
access  control  of  users  to  resources.  In  addition,  the  access 
decision  is  usually  on  the  basis  of  mechanisms  having  a  well- 
defined  trust  management  language  that  has  strong  verification 
and  proof  support.  Such  a  policy-based  trust  management 
approach  usually  makes  a  binary  decision  according  to  which 
the  requester  is  trusted  or  not,  and  accordingly  the  access 
request  is  allowed  or  not.  Due  to  the  binary  nature  of  trust 
evaluation,  policy-based  trust  management  has  less  flexibility. 
Furthermore,  the  availability  of  (or  access  to)  trusted  certificate 
authorities  (CA)  cannot  always  be  guaranteed,  particularly 
for  distributed  systems  such  as  MANETs.  On  the  other 
hand,  reputation-based  trust  management  utilizes  numerical 
and  computational  mechanisms  to  evaluate  trust.  Typically,  in 
such  a  system,  trust  is  calculated  by  collecting,  aggregating, 
and  disseminating  reputation  among  the  entities. 

According  to  Li  and  Singhal  [35],  trust  management 
can  be  classified  as  evidence-based  trust  management  and 
monitoring-based  trust  management.  Evidence-based  trust 
management  considers  anything  that  proves  trust  relationships 
among  nodes:  these  could  include  public  key,  address,  identity, 
or  any  evidence  that  any  node  can  generate  for  itself  or  other 
nodes  through  a  challenge  and  response  process.  Monitoring- 
based  trust  management  rates  the  trust  level  of  each  partici- 
pating node  based  on  direct  information  (e.g.,  observing  the 
benign  or  malicious  behaviors  of  neighboring  nodes,  such 
as  packet  dropping,  and  packet  flooding  leading  to  excessive 
resource  consumption  in  the  network,  or  denial  of  service  at- 
tacks) as  well  as  indirect  information  (e.g.,  reputation  ratings, 
such  as  recommendations  forwarded  from  other  nodes). 

Aivaloglou  et  al.  [36]  classify  two  types  of  trust  establish- 
ment frameworks  for  MANETs:  certificate-based  framework 
versus  behavior-based  framework.  In  the  former,  mechanisms 
are  defined  for  pre-deployment  knowledge  of  trust  relation- 
ships within  the  network,  using  certificates  which  are  dis- 
tributed, maintained  and  managed,  either  independently  or 
cooperatively  by  the  nodes.  Trust  decisions  can  be  made  based 
on  a  valid  certificate  that  proves  trustworthiness  of  the  target 
node  by  a  certificate  authority  or  by  other  nodes  that  the  issuer 
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trusts.  In  behavior-based  framework,  each  node  continuously 
monitors  behaviors  of  its  neighboring  nodes  in  order  to 
evaluate  trust.  The  behavior-based  framework  is  a  reactive 
approach,  operating  under  the  assumption  that  the  identities  of 
nodes  in  the  network  are  ensured  by  preloaded  authentication 
mechanisms.  For  example,  if  a  node  uses  network  resources 
in  an  unauthorized  way,  it  will  be  regarded  as  a  selfish  or 
malicious  node,  and  will  finally  be  isolated  from  other  nodes. 

Aivaloglou  et  al.  [36]  also  classify  trust  establishment 
schemes  in  terms  of  the  type  of  architectures  used:  hierarchi- 
cal framework  versus  distributed  framework.  In  the  former,  a 
hierarchy  exists  among  the  nodes  based  on  their  capabilities 
or  levels  of  trust.  In  this  framework,  centralized  certificate  au- 
thorities or  trusted  third  parties  are  usually  provided  for  on-line 
or  off-line  evidence.  Such  a  centralized  infrastructure  does  not 
exist  in  a  distributed  framework;  hence,  each  node  has  some, 
possibly  equal,  responsibility  for  acquiring,  maintaining,  and 
distributing  trust  evidence. 

Even  though  reputation  management  is  part  of  trust  man- 
agement, many  researchers  further  classify  reputation  man- 
agement schemes.  Adams  et  al.  [44]  propose  three  types  of 
reputation  systems:  positive  reputation,  negative  reputation, 
and  a  combination  of  the  two.  Positive  reputation  systems  only 
consider  observations  or  feedback  of  the  positive  behaviors  of 
a  node.  Negative  reputation  systems  only  record  complaints 
or  observations  of  the  negative  behaviors  of  a  node.  Peers  are 
assumed  to  be  trusted  and  so  feedback  on  behaviors  is  used 
to  negatively  reflect  a  node's  reputation.  To  complement  the 
drawbacks  of  these  mechanisms,  hybrid  reputation  systems 
have  been  proposed  [53].  For  more  information  on  reputation 
management,  the  readers  may  refer  to  [11]. 

B.  Potential  Attacks 

It  is  important  to  ensure  that  a  trust  management  system 
itself  should  not  be  easily  subverted,  attacked  or  compro- 
mised. In  this  section,  we  discuss  various  common  attacks 
and  describe  features  important  from  the  viewpoint  of  trust 
management.  A  survey  of  threat  models  and  specific  attacks 
on  ad  hoc  routing  protocols  are  described  by  Argyroudis  et 
al.  [54]  and  Djenouri  et  al.  [55]. 

Liu  et  al.  [49]  describe  the  characteristics  of  attacks  in 
MANETs  by  both  the  nature  of  attacks  and  the  type  of 
attackers.  One  classification  of  attacks  is  passive  attack  versus 
active  attack.  A  passive  attack  occurs  when  an  unauthorized 


party  gains  access  to  an  asset  but  does  not  modify  its  content. 
Passive  attacks  include  eavesdropping  and  traffic  analysis  (e.g., 
traffic  flow  analysis).  Eavesdropping  indicates  that  the  attacker 
monitors  transmissions  of  message  content.  Traffic  analysis 
refers  to  analyzing  patterns  of  data  transmission.  An  active 
attack  occurs  when  an  unauthorized  party  modifies  a  message, 
data  stream,  or  file.  Active  attacks  usually  take  the  form  of 
one  of  the  following  four  types  or  combinations:  masquerad- 
ing (i.e.,  impersonation  attack),  replay  (i.e.,  retransmitting 
messages),  message  modification,  and  denial-of-service  (DoS) 
(leading  to  excessive  resource  consumption  in  the  network). 

Yet  another  way  to  characterize  attacks  is  based  on  the 
legitimacy  of  an  entity  in  a  network:  insider  attack  versus 
outsider  attack  [56].  If  an  entity  is  authorized  to  access  system 
resources  but  employs  them  in  a  malicious  way  (e.g.,  in  a 
way  not  approved  by  the  authorizer),  it  is  classified  as  an 
insider  attack.  More  specifically,  inside  attackers  exploit  bugs 
in  privileged  system  programs  or  poorly  configured  privileges, 
and  then  they  may  install  backdoors  or  Trojan  horses  or 
other  such  mechanisms  to  facilitate  subsequent  acquisition  of 
privileged  access.  On  the  other  hand,  an  outsider  attack  is 
initiated  by  an  unauthorized  or  illegitimate  user.  They  usually 
acquire  access  to  an  authorized  account  and  try  to  perpetrate 
insider  attacks.  Both  attackers  may  spoof  network  protocols 
to  effectively  acquire  access  to  an  authorized  account. 

Many  trust  management  schemes  are  devised  to  detect 
misbehaving  nodes,  both  selfish  nodes  as  well  as  malicious 
nodes.  Specific  attack  examples  are  described  as  follows  (the 
list  is  representative,  not  exhaustive): 

•  Routing  loop  attacks'.  A  malicious  node  may  modify 
routing  packets  in  such  a  way  that  packets  traverse  a 
cycle  and  so  do  not  reach  the  intended  destination  [56]. 

•  Wormhole  attacks:  A  group  of  cooperating  malicious 
nodes  can  pretend  to  connect  two  distant  points  in  the 
network  with  a  low-latency  communication  link  called  a 
wormhole  link,  causing  disruptions  in  normal  traffic  load 
and  flow  [57]  [58]  [59]. 

•  Blackhole  attacks:  A  malicious  node,  the  so  called 
black  hole  node,  may  always  respond  positively  to  route 
requests  even  when  it  does  not  have  proper  routing  infor- 
mation. The  black  hole  can  drop  all  packets  forwarded 
to  it  [60]. 

•  Gray  hole  attacks:  A  malicious  node  may  selectively  drop 
packets  [61],  as  a  special  case  of  a  black  hole  attack.  For 
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example,  the  malicious  node  may  forward  routing  packets 
but  not  data  packets.  Similarly,  a  sinkhole  attacker  attracts 
nodes  to  route  through  it  and  then  selectively  routes 
packets  [49]. 

•  DoS  attacks:  A  malicious  node  may  block  the  normal 
use  or  management  of  communications  facilities,  for  ex- 
ample, by  causing  excessive  resource  consumption  [62]. 

•  False  information  or  false  recommendation:  A  mali- 
cious node  may  collude  and  provide  false  recommen- 
dations/information to  isolate  good  nodes  while  keeping 
malicious  nodes  connected.  In  the  stacking  attack,  a 
malicious  node  keeps  complaining  about  a  peer  node  and 
creates  the  peer's  negative  reputation  [44]  [63]. 

•  Incomplete  information:  A  malicious  node  may  not 
cooperate  in  providing  proper  or  complete  information. 
Usually  compromised  nodes  collude  to  perform  this  at- 
tack. However,  node  mobility  or  link  failure,  prevalent 
in  MANETs,  may  also  result  in  the  same  phenomenon 
[8][34]. 

•  Packet  modification/insertion:  A  malicious  node  may 
modify  packets  or  insert  malicious  packets  such  as  pack- 
ets with  incorrect  routing  information  [64] . 

•  Newcomer  attacks:  A  malicious  node  may  discard  its  bad 
reputation  or  distrust  by  registering  as  a  new  user.  The 
malicious  node  simply  leaves  the  system  and  joins  again 
for  trust  revocation,  flushing  out  its  previous  bad  history 
and  starting  to  accumulate  new  trust  [65]. 

•  Sybil  attacks:  A  malicious  node  can  use  multiple  net- 
work identities  which  can  affect  topology  maintenance 
and  fault  tolerant  schemes  such  as  multi-path  routing 
[61][49][46]. 

•  Blackmailing:  A  malicious  node  can  blackmail  another 
node  by  disseminating  false  information  that  another 
node  is  malicious  or  misbehaving.  This  can  generate 
significant  amount  of  traffic  and  ultimately  disrupt  the 
functionality  of  the  entire  network  [49].  This  attack  can 
be  seen  as  false  accusation  plus  DoS  attacks  in  the 
sense  that  false  information  is  disseminated  leading  to 
a  significant  amount  of  resource  consumption. 

•  Replay  attacks:  A  malicious  node  may  replay  earlier 
transmitted  packets.  If  the  packets  include  data,  this 
should  not  cause  trouble,  and  the  receiving  node  just 
discards  erroneous  packets.  However,  if  the  adversary 
replays  route  requests,  routing  table  information  would 
become  erroneous,  and  old  locations  and  routing  infor- 
mation might  make  nodes  unreachable  [56]. 

•  Selective  misbehaving  attacks:  A  malicious  node  behaves 
badly  but  selectively  to  other  nodes  [66]. 

•  On-off  attacks:  A  malicious  node  may  alternatively  be- 
have well  and  badly  to  stay  undetected  while  disrupting 
services  [66]. 

•  Conflicting  behavior  attacks:  A  malicious  node  may 
behave  differently  to  nodes  in  different  groups  to  make 
the  opinions  from  the  different  good  groups  conflicting, 
and  ultimately  lead  to  non-trusted  relationships  [52]. 

Figure  7  shows  various  attacks  considered  in  a  survey  of 
43  papers.  Note  that  the  "general  selfish"  category  means 
no  specific  information  is  given  in  the  work  except  that  it 
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Fig.  7.  Attacks  considered  in  existing  trust  management  systems  in 
MANETs. 


deals  with  selfish  nodes.  Also  papers  in  the  "general  mis- 
behaving" category  deal  with  a  broad  range  of  misbehaving 
nodes,  including  malicious  and  selfish  nodes,  but  do  not 
provide  detailed  information.  "Packet  related"  attacks  include 
packet  dropping,  packet  modification,  packet  insertion,  and 
selective  packet  forwarding.  "Identity  related"  attacks  include 
impersonation,  masquerading,  and  Sybil  attacks.  Except  for 
the  "general  selfish"  and  "general  misbehaving"  categories, 
we  notice  that  "false  information"  (e.g.,  including  false  rec- 
ommendation or  reputation)  and  "packet  related"  attacks  are 
dominantly  considered  in  the  literature  on  trust  management 
schemes  for  MANETs.  Figure  7  illustrates  that  most  of  the 
attacks  considered  in  the  literature  on  trust  management  are 
general  attacks  often  targeted  at  other  aspects  of  MANETs. 
Hence,  the  trust  evaluation  engine  should  be  robust  and 
degrade  gracefully  if  some  information  or  evidence  does  not 
provide  a  certain  level  of  trust  based  on  partial  or  potentially 
corrupted  information. 

C.  Metrics  for  MANET  Trust  Management 

Although  many  trust  management  schemes  have  been  pro- 
posed to  evaluate  trust  values,  no  work  clearly  addresses  what 
should  be  measured  to  evaluate  network  trust.  Liu  et  al. 
[49]  defined  trust  in  their  model  as  reliability,  timeliness,  and 
integrity  of  message  delivery  to  the  intended  next-hop.  Also 
most  trust-based  protocols  for  secure  routing  calculated  trust 
values  based  on  the  characteristics  of  nodes  behaving  properly 
at  the  network  layer.  Trust  measurement  can  be  application- 
dependent  and  will  be  different  based  on  the  design  goals  of 
proposed  schemes. 

Based  on  31  papers.  Figure  8  shows  various  performance 
metrics  that  have  been  used  to  evaluate  trust  management 
schemes  for  MANETs.  Note  that  a  single  work  may  use 
multiple  performance  metrics.  Figure  8  shows  standard  system 
performance  metrics  typically  used  to  evaluate  trust  manage- 
ment systems;  these  metrics  include  overhead  (e.g.,  control 
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Fig.  8.    Metrics  considered  by  MANET  trust  management  systems. 

packet  overheads),  throughput,  goodput,  packet  dropping  rate, 
and  delay.  "Route  usage"  refers  to  the  number  of  routes 
selected  particularly  when  the  purpose  is  for  secure  routing. 
"Trust  level"  is  a  recently  used  system  metric.  Example 
metrics  using  the  trust  level  include  confidence  level  of 
the  trust  value,  trustworthiness,  opinion  values  about  other 
nodes,  and  trust  level  per  session.  "Others"  indicates  metrics 
that  consider  system  tolerance  based  on  incorrect  reputation 
threshold,  availability,  convergence  time  to  reach  steady  state 
in  trustworthiness  of  all  participating  nodes,  and  percentage 
of  malicious  nodes. 

IV.  MANET  Trust  Management  Schemes 

This  section  summarizes  trust  management  schemes  that 
have  been  developed  for  MANETs. 

We  describe  trust  management  schemes  based  on  specific 
design  purposes  such  as  secure  routing,  authentication,  intru- 
sion detection,  access  control  (authorization),  and  key  manage- 
ment. Further,  we  also  describe  existing  general  frameworks 
for  trust  (or  reputation)  evidence  distribution  and  evaluation. 
Figure  9  summarizes  45  trust  management  schemes  proposed 
for  MANETs  during  2000-2009  based  on  their  design  pur- 
poses. Note  that  under  each  research  category,  we  will  survey 
existing  works  in  chronological  order. 

A.  Secure  Routing 

Most  reputation-based  trust  management  schemes  are  de- 
vised for  collaborative  secure  routing  by  detecting  misbehav- 
ing nodes,  both  selfish  and  malicious  ones.  Marti  et  al.  [67] 
proposed  a  reputation-based  trust  management  scheme  that 
consists  of  a  watchdog  that  monitors  node  behaviors  and  a 
pathrater  that  collects  reputation  and  takes  response  actions 
(e.g.,  isolating  misbehaving  nodes  as  a  result  of  misbehavior 
detection).  This  work  is  an  initiative  to  dynamically  incorpo- 
rate direct  observations  into  trust  values  for  secure  routing.  It 
extends  DSR  (Dynamic  Source  Routing)  but  trust  evaluation 
is  based  only  on  direct  observations. 

Buchegger  et  al.  [68]  initiated  a  new  design  to  develop 
a  routing  protocol  by  introducing  a  "trust  manager"  in  their 
scheme.  They  determined  trust  levels  based  on  self-monitored 


information  while  employing  reputation  collected  from  both 
direct  and  indirect  observations  and  experiences.  They  did  not 
show  any  experimental  results,  but  pose  several  interesting 
questions  such  as  what  is  a  sustainable  relationship  between 
the  total  number  of  nodes  in  the  network,  the  maximum 
number  of  malicious  nodes  the  system  can  tolerate,  and  the 
minimum  number  of  friends  per  node  needed  to  achieve 
high  tolerance,  and  a  prescribed  level  of  trust.  Buchegger  et 
al.  [69]  also  developed  a  reputation-based  trust  management 
scheme  called  CONFIDANT  (Cooperation  Of  Nodes-Fairness 
In  Dynamic  Ad-hoc  NeTworks)  based  on  both  direct  and 
indirect  observations  to  detect  misbehaving  nodes.  The  unique 
feature  in  this  work  is  an  incentive  mechanism  for  altruistic 
nodes  to  be  paid  as  a  result  of  cooperation. 

Paul  and  Westhoff  [70]  proposed  a  context-aware  mecha- 
nism for  detecting  selfish  nodes  by  extending  DSR  with  a 
context-aware  inference  scheme  to  punish  the  accused  and  the 
malicious  accuser.  However,  the  use  of  digital  signatures  to 
disseminate  information  about  the  accused  and  the  malicious 
accuser  may  not  be  viable  in  a  resource-constrained  MANET 
environment. 

Michiardi  et  al.  [71]  proposed  CORE  (Collaborative  REp- 
utation)  that  has  a  monitoring  mechanism  complemented  by 
a  reputation  functionality  that  differentiates  between  direct 
reputation,  indirect  reputation,  and  functional  reputation  (task- 
specific  behavior).  The  proposed  protocol  is  developed  to 
make  decisions  about  cooperation  or  gradual  isolation  of  a 
node.  A  unique  characteristic  of  this  mechanism  is  that  it 
exchanges  only  positive  reputation  information.  However,  this 
may  limit  its  reliance  on  positive  reports  without  the  facility 
to  submit  negative  feedback. 

He  et  al.  [72]  proposed  a  reputation-based  trust  management 
scheme  using  an  incentive  mechanism,  called  SORI  (Secure 
and  Objective  Reputation-based  Incentive).  This  scheme  en- 
courages packet  forwarding  and  discourages  selfish  behav- 
iors based  on  quantified  objective  measures  and  reputation 
propagation  by  a  one-way  hash  chain  based  authentication. 
The  performance  of  this  scheme  in  the  presence  of  malicious 
nodes,  as  may  be  expected  in  a  hostile  environment,  has  not 
been  investigated. 

Nekkanti  and  Lee  [73]  extended  AODV  (Ad  hoc  On- 
demand  Distance  Vector)  using  trust  factor  and  security  level 
at  each  node.  Their  approach  deals  differently  with  each  route 
request  based  on  the  node's  trust  factor  and  security  level. 
In  a  typical  scheme,  routing  information  for  every  request 
would  be  encrypted  leading  to  large  overheads;  they  propose 
to  use  different  levels  of  encryption  based  on  the  trust  factor 
of  a  node,  thus  reducing  overhead.  This  approach  adjusts 
the  security  level  based  on  the  recognized  hostility  level  and 
hence  can  conserve  resources;  however,  the  approach  does  not 
treat  evaluation  of  trust  itself.  Li  et  al.  [74]  also  extended 
AODV  and  adopted  a  trust  model  to  guard  against  malicious 
behaviors  of  nodes  at  the  network  layer.  They  represented 
trust  as  opinion  stemming  from  subjective  logic.  The  opinion 
reflects  the  characteristics  of  trust  in  MANETs,  particularly 
dynamicity.  The  key  feature  is  to  consider  system  performance 
aspects  by  dealing  with  each  query  based  on  its  level  of 
trust.  Depending  on  the  level  of  trust  of  nodes  involved 
in  the  query,  there  is  no  need  for  a  node  to  request  and 
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Fig.  9.    Metrics  considered  by  MANET  trust  management  systems. 


verify  certificates  all  the  time,  thereby  leading  to  significant 
reduction  of  computation  and  communication  overhead.  This 
work  advances  trust  management  by  considering  a  generic 
trust  management  framework  for  MANETs. 

Pisinou  et  al.  [75]  devised  a  secure  AODV-based  routing 
protocol  for  multi-hop  ad  hoc  networks  for  discovering  a 
secure  end-to-end  route  free  of  any  compromised  nodes.  Their 
trust-based  routing  protocol  calculates  trust  values  based  only 
on  direct  observations,  assuming  that  trust  is  transitive.  As 
a  continuation  of  [68],  Buchegger  et  al.  [76]  also  proposed 
a  fully  distributed  reputation  system  in  order  to  cope  with 
false  information  propagation.  The  proposed  design  maintains 
a  reputation  and  trust  rating  system  about  individual  nodes 
by  designing  a  modified  Bayesian  approach.  Recognizing  the 
dynamic  nature  of  trust  and  reputation,  the  authors  introduced 
reevaluation  and  reputation  fading  as  well  as  redemption 
mechanisms.  Nevertheless,  no  other  characteristics  of  trust  are 
addressed  except  for  dynamicity. 

Ghosh  et  al.  [77]  enhanced  trust  management  by  consider- 
ing the  confidence  level  of  trust.  Their  use  of  the  confidence 
level  as  a  weight  on  the  computed  trust  value  and  the  method 
for  calculating  trust  in  a  fully  distributed  way  provide  a 
general  framework  that  can  be  applied  to  non-trust-aware 
routing  protocols.  In  [77],  SORI  [72]  is  extended  to  alleviate 
the  problem  of  selfish  nodes,  by  considering  the  number  of 
forwarding  packets  to  evaluate  the  confidence  level. 

Wang  et  al.  [78]  proposed  a  mechanism  to  distinguish 
selfish  peers  from  cooperative  ones  based  solely  on  local 
observations  of  AODV  routing  protocol  behaviors.  They  use  a 
finite  state  machine  model  of  locally  observed  AODV  actions 
to  construct  a  statistical  description  of  each  peer's  behavior.  In 
order  to  distinguish  between  selfish  and  cooperative  peers,  a 
series  of  well-known  statistical  tests  are  applied  to  features 
obtained  from  the  observed  AODV  actions.  An  interesting 
extension  of  this  work  would  be  to  consider  various  patterns 
of  node  mobility  which  can  give  additional  insights. 

Zouridaki  et  al.  [79]  proposed  a  trust  establishment  mech- 
anism for  MANETs  called  Herms  to  improve  the  reliability 
of  packet  forwarding  over  multi-hop  routes  in  the  presence 
of  potentially  malicious  nodes.  Essentially,  direct  observations 
are  used  to  evaluate  opinions  about  others.  Also,  confidence 
level  is  used  as  a  weight  to  evaluate  trust  of  other  nodes  based 
on  a  Bayesian  approach.  They  also  introduced  a  windowing 
scheme  to  systematically  expire  old  data  to  maintain  accuracy 


of  the  opinion  metric  in  the  face  of  dynamics.  However,  this 
scheme  is  vulnerable  to  attacks  that  can  exploit  the  windowing 
scheme  to  disseminate  false  information  to  accuse  good  nodes 
and  to  keep  bad  nodes  in  the  system  (such  as  badmouthing 
attacks). 

As  an  extension,  Zouridaki  et  al.  [80]  employed  both 
first-hand  trust  information  based  on  direct  observations  and 
second-hand  trust  information  forwarded  from  neighboring 
nodes  about  non-neighboring  nodes.  This  trust  establishment 
scheme  can  cope  with  more  attacks,  including  propagation  of 
false  recommendations  or  information,  identifying  bad  nodes 
among  neighboring  nodes,  colluding  attacks,  replay  attacks, 
and  duplicate  attacks.  It  is  noteworthy  that  they  used  only 
security  related  metrics  to  evaluate  their  scheme,  such  as 
trustworthiness  and  the  percentage  of  nodes  recognized  as  bad. 

Pirzada  et  al.  [81]  proposed  and  examined  the  efficacy 
of  trust-based  reactive  routing  protocols  in  the  presence  of 
attacks.  This  work  only  considers  first  hand  information  to 
evaluate  other  nodes'  trust  values.  Thus,  trust  evaluation  is 
restricted  to  direct  neighboring  nodes. 

Sun  et  al.  [46]  proposed  trust  modeling  and  evaluation 
methods  for  secure  ad  hoc  routing  and  malicious  node  de- 
tection. The  unique  part  of  their  design  is  to  consider  trust  as 
a  measure  of  uncertainty  that  can  be  calculated  using  entropy. 
In  their  definition,  trust  is  a  continuous  variable,  and  does  not 
need  to  be  transitive,  thus  capturing  some  of  the  characteristics 
of  trust  in  MANETs.  However,  this  work  considers  packet 
dropping  as  the  only  component  of  direct  observations  to 
evaluate  trust. 

Abusalah  et  al.  [82]  proposed  a  trust-aware  routing  protocol 
(TARP)  and  developed  a  trust  metric  based  on  six  trust  com- 
ponents including  software  configuration,  hardware  configura- 
tion, battery  power,  credit  history,  exposure  and  organizational 
hierarchy.  However,  no  consideration  was  given  to  trust  decay 
over  time  and  space  to  reflect  uncertainty  due  to  dynamics  and 
incomplete  information  in  MANET  environments. 

Sen  et  al.  [83]  proposed  a  trust-based  mechanism  to  de- 
tect malicious  packet  dropping  nodes  based  on  reputation  of 
neighboring  nodes,  and  take  into  account  the  decay  of  trust 
over  time.  This  work  assumes  that  a  pair  of  public/private  keys 
can  be  preloaded  to  prevent  identity-related  attacks.  However, 
this  may  not  be  scalable  for  a  large  network. 

Soltanali  et  al.  [84]  proposed  a  distributed  mechanism  to 
deal  with  selfish  nodes  as  well  as  to  encourage  cooperation  in 
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MANETs  based  on  the  combination  of  reputation-based  and 
currency-based  incentive  mechanism  mitigating  their  defects 
and  improving  their  advantages.  Compared  to  existing  works, 
this  work  considers  more  aspects  of  trust  such  as  dynamicity, 
weighted  transitivity,  and  subjectivity.  However,  it  used  only 
packet  forwarding  behaviors  to  evaluate  a  node's  trust  and 
standard  performance  metrics  to  evaluate  the  proposed  trust 
scheme. 

Balakrishnnan  et  al.  [85]  developed  a  trust  model  to 
strengthen  the  security  of  MANETs  and  to  deal  with  the  issues 
associated  with  recommendations.  Their  model  utilizes  only 
trusted  routes  for  communication,  and  isolates  malicious  nodes 
based  on  the  evidence  obtained  from  direct  interactions  and 
recommendations.  Their  protocol  is  described  as  robust  to  the 
recommender's  bias,  honest-elicitation,  and  free-riding.  This 
work  uniquely  considered  a  context-dependency  characteristic 
of  trust  in  extending  DSR. 

Li  et  al.  [52]  stated  that  using  only  a  reputation-based  trust 
framework  gives  only  an  incomplete  partial  solution  for  trust 
management.  They  proposed  an  objective  trust  management 
framework  (OTMF)  for  MANETs  based  on  both  direct  and 
indirect  information  for  reputation  management  and  showed 
the  effectiveness  of  OTMF.  This  work  used  the  term  "ob- 
jective" trust  to  refer  to  trust  evaluated  based  on  second- 
hand information.  However,  this  work  did  not  consider  node 
collusion  in  obtaining  second-hand  information,  which  may 
lead  to  incorrect  recommendations. 

Mundinger  and  Boudec  [86]  were  the  first  to  analyze  the 
robustness  of  a  reputation  system  based  on  a  deviation  test. 
Using  a  mean-field  approach  in  their  stochastic  model,  they 
showed  that  liars  have  no  impact  unless  the  number  of  liars 
exceeds  a  certain  threshold  (a  phase  transition).  They  provided 
precise  formulas  for  the  critical  values  and  guidelines  for  an 
optimal  choice  of  parameters.  This  work  is  unique  in  that  it 
evaluates  a  system's  tolerance  to  untrusted  nodes;  however,  the 
reputation  evaluation  is  based  only  on  the  "fake"  information. 

Moe  et  al.  [87]  proposed  a  trust-based  routing  protocol  as 
an  extension  of  DSR  based  on  an  incentive  mechanism  that 
enforces  cooperation  among  nodes  and  reduces  the  benefits 
that  selfish  nodes  can  enjoy  (e.g.,  saving  resources  by  selec- 
tively dropping  packets).  This  work  is  unique  in  that  they  used 
a  hidden  Markov  model  (HMM)  to  quantitatively  measure  the 
trustworthiness  of  nodes.  In  this  work,  selfish  nodes  are  benign 
and  selectively  drop  packets.  Performance  characteristics  of 
the  protocol  when  malicious  nodes  perform  active  attacks 
such  as  packet  modifications,  identity  attacks,  etc.,  need  to 
be  investigated  further. 

In  quorum  or  threshold  schemes,  a  node  must  successfully 
interact  with  at  least  k  of  n  distributed  trusted  authority  (TA) 
nodes.  Finding  k  such  nodes  can  be  resource  intensive.  Reidt 
et  al.  [88]  prioritize  the  TA  nodes  and  find  a  route  to  connect 
to  k  desirable  TA  nodes  so  as  to  minimize  a  performance 
metric  such  as  overhead,  taking  into  account  reliability  and 
energy  consumption  of  individual  nodes.  Significant  savings 
over  a  standard  system  were  shown.  An  interesting  aspect,  not 
considered  yet,  would  be  to  incorporate  trustworthiness  into 
the  TA  selection  and  routing  scheme. 

Ayachi  et  al.  [89]  formalized  implicit  trust  relations  in 
AODV  and  demonstrated  that  a  node  can  utilize  these  trust 


relations  to  isolate  malicious  nodes  for  secure  routing.  Nodes 
overhear  neighbors'  transmissions  from  which  they  can  build 
a  neighbor  routing  table  and  check  for  deviation  from  nor- 
mal behaviors  for  AODV.  This  scheme  can  detect  malicious 
behaviors  such  as  message  replication,  message  forgery  and 
some  instances  of  message  modification.  However,  it  is  not 
amenable  to  incorporation  of  other  trust  metric  components, 
such  as  intimacy  and  competence  but  monitored  behaviors 
could  feed  into  a  trust  evaluation  scheme. 

Adnane  et  al.  [90]  proposed  trust-based  countermeasures 
to  isolate  malicious  nodes  extending  OLSR  (Optimized  Link 
State  Routing).  Their  protocol  provides  secure  routing  paths 
by  identifying  malicious  nodes.  The  focus  of  the  protocol  is 
to  prevent  usurpation  of  node  identities.  Performance  analysis 
under  other  types  of  attacks  remains  to  be  investigated. 

Although  many  researchers  have  developed  secure  routing 
protocols  using  trust,  most  of  the  approaches  have  focused  on 
monitoring  routing  behaviors  and  the  evaluation  of  trust  has 
been  in  the  context  of  communication  networks.  Further  steps 
should  be  taken  to  refine  issues  such  as  (1)  how  to  quantify 
trust  in  a  MANET  node;  (2)  how  to  employ  (a  continuous- 
valued)  trust  in  a  routing  decision;  and  (3)  how  to  develop  a 
composite  trust  metric  incorporating  task  performance  goals, 
taking  into  account  the  social  aspects  of  a  MANET  node. 

B.  Authentication 

There  have  been  efforts  to  establish  trust  relationships  to 
ensure  authentication  in  MANETs.  Weimerskirch  et  al.  [91] 
developed  a  trust  model  based  on  human  behavior,  noting  that 
society  can  be  properly  considered  as  an  ad  hoc  network. 
They  used  recommendations  from  a  distributed  trust  model  to 
construct  trust  relationships  and  extended  it  by  a  request  for 
recommendations.  Based  on  models  derived  from  observations 
of  human  society,  recommendations  are  used  to  calculate  trust, 
with  weights  based  on  the  distance  of  relationships.  Their 
definition  does  not  assume  symmetry  or  complete  transitivity, 
thus  capturing  essential  features  of  trust  in  MANETs.  The 
assumption  of  low-value  transactions  does  not  require  any 
evidence-based  mechanism  to  ensure  trust  such  as  authen- 
tications using  public/private  keys.  Consequently,  it  is  not 
applicable  to  systems  where  hostility  may  be  high,  or  where 
consequences  of  misplaced  trust  can  be  severe. 

Verma  et  al.  [92]  presented  an  overview  of  a  trust  negoti- 
ation scheme  using  DSR  and  ZRP  (Zone  Routing  Protocol). 
Their  scheme  consists  of  two  components.  The  peer-to-peer 
component  deals  with  secure  communications  with  neighbors 
in  a  lightweight  manner.  The  heavyweight  remote  component 
performs  trust  negotiation  and  establishes  secure  end-to-end 
communication.  The  main  goal  of  this  work  is  to  add  ro- 
bustness in  the  process  of  trust  negotiation,  rather  than  trust 
evaluation. 

Pirzada  and  McDonald  [93]  proposed  a  trust-based  com- 
munication model  that,  based  on  a  notion  of  a  belief,  pro- 
vides a  dynamic  measure  of  reliability  and  trustworthiness 
in  MANETs.  The  merit  of  this  work  is  to  incorporate  utility 
as  general  trust  and  time  as  situational  trust  into  the  overall 
trust  metric  to  evaluate  an  agent  in  the  network.  However,  the 
situational  trust  considered  is  limited  to  monitoring  dynamics 
of  packet  forwarding  behaviors. 
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Davis  [47]  proposed  a  reliable  and  structured  hierarchical 
model  for  trust  management  in  MANETs  that  is  robust  to 
malicious  accusation  exploits.  The  scheme  deals  with  explicit 
revocation  of  certificates  in  a  distributed  way,  eliminating  the 
case  in  which  revoked  certificates  can  be  accepted  as  valid., 
This  work  assumes  that  the  initial  certificates  and  public  keys 
of  all  nodes  are  distributed  by  a  centralized  trust  authority  to 
each  node  before  the  network  is  deployed  which  may  not  be 
scalable  in  a  large  scale  MANET.  The  paper  does  not  discuss 
the  issue  of  false  positives  which  can  lead  to  continual  eviction 
of  nodes,  and  eventually  loss  of  network  connectivity.  To 
counteract  this,  dynamic  reissue  of  certificates  may  be  needed 
which  may  incur  extra  overhead. 

Ngai  and  Lyu  [94]  proposed  a  secure  public  key  authentica- 
tion service  based  on  their  trust  model  to  prevent  propagation 
of  false  public  keys  in  the  presence  of  malicious  nodes. 
Trust  is  evaluated  based  on  direct  monitoring  as  well  as 
recommendation.  However,  this  work  does  not  consider  group 
membership  changes,  the  distance  from  the  evaluator,  and  their 
effect  on  the  performance  of  their  trust  management  scheme. 

In  summary,  there  has  been  quite  a  bit  of  work  on  using 
trust  for  authentication.  However,  as  in  the  case  of  trust-based 
secure  routing,  the  models  and  protocols  used  are  based  solely 
on  monitoring  packet  forwarding  behaviors. 

C.  Intrusion  Detection 

Trust  can  be  used  as  a  basis  for  developing  an  intrusion 
detection  system  (IDS).  Also,  IDS  itself  can  help  nodes 
measure  trust  of  other  nodes  when  they  cooperate  with  each 
other  to  detect  malicious  nodes.  Albers  et  al.  [95]  proposed 
a  general  architecture  for  an  intrusion  detection  system  (IDS) 
called  a  Local  IDS  (LIDS)  such  that  intrusion  detection  can 
be  performed  locally  among  trustworthy  participating  nodes. 
Here,  trust  is  used  to  detect  intrusions  in  the  system.  In  Ahmed 
et  al.  [96],  IDS  provides  audit  and  monitoring  capabilities 
that  offer  local  security  to  a  node  and  helps  perceive  the 
specific  trust  levels  of  other  nodes.  Hence,  evaluating  trust  and 
identifying  intrusions  may  not  be  totally  separated  processes. 

D.  Access  Control 

Trust  also  can  be  applied  in  determining  whether  or  not  to 
grant  access  to  certain  resources  or  rights.  Gray  et  al.  [97] 
integrated  trust-based  admission  control  with  standard  role- 
based  access  control.  By  doing  this,  an  access  control  decision 
is  effectively  made  without  being  affected  by  incomplete  infor- 
mation collected  in  MANETs.  A  simple  distributed  blackjack 
card  game  application  is  described,  in  which  the  trust-based 
admission  control  system  is  used  to  assign  roles  to  users  based 
on  their  trust-based  admission  rights.  It  is  not  clear  how  the 
approach  can  be  extended  to  a  general  framework  applicable 
to  MANETs. 

Luo  et  al.  [98]  presented  a  ubiquitous  and  robust  access 
control  solution  (URSA)  for  MANETs  based  on  a  localized 
group  trust  model  so  that  only  well  behaving  nodes  will  have 
access  rights  to  network  resources.  Their  localized  group  trust 
model  for  MANETs  is  based  on  threshold  cryptography:  a 
node  is  globally  trusted  only  if  it  is  individually  trusted  by 
any  k  trusted  nodes  where  k  is  a  system-wide  trust  threshold. 


This  work  assumes  that  the  node  density  is  large  enough  so 
that  any  node  can  find  k  trusted  nodes,  perhaps  by  moving 
to  another  location.  Interesting  extensions  of  the  work  include 
consideration  of  mobility  models  other  than  random  waypoint, 
and  trust  evaluation  under  high  node  mobility  situations. 

Adams  and  Davis  [17]  presented  a  decentralized  access 
control  system  implementing  sociological  trust  constructs  in 
a  quantitative  system  to  evaluate  the  relationships  between 
entities.  A  distributed,  node-centric  approach  to  reputation 
management  considers  a  node's  behavior  feedback  and  gives  a 
reputation  index  that  nodes  can  use  to  determine  the  trustwor- 
thiness of  their  peers  before  establishing  trust  relationships. 
This  work  further  assessed  risk  using  a  Bayesian  approach 
to  evaluate  trust.  Interestingly,  this  work  used  reputation  as 
a  weight  to  evaluate  direct  observations,  which  is  a  different 
approach  from  most  existing  works.  Extensions  of  the  scheme 
to  handle  network  dynamics  would  be  useful. 

Yunfang  [53]  proposed  an  integrated  mechanism  of  policy 
proof  and  reputation  evolution  into  trust  management  for 
decision-making  on  access  control  with  the  goal  of  providing 
firm/objective  security  as  well  as  social/subjective  security. 
However,  this  work  is  based  on  the  assumption  that  trust 
is  completely  transitive,  and  it  is  not  clear  how  a  more 
realistic  transitivity  model  can  be  incorporated  into  the  trust 
management  system. 

E.  Key  Management 

Virendra  et  al.  [99]  proposed  a  trust-based  security  archi- 
tecture for  key  management  in  MANETs.  This  architecture 
aims  to  establish  keys  between  nodes  based  on  their  trust 
relationships,  and  to  build  secure  distributed  control  using  trust 
as  a  metric.  In  their  self-organizing  trust-based  architecture, 
nodes  are  organized  into  trust-based  clusters  called  Physical- 
Logical  Trust  Domains  (PLTDs),  a  group  of  trusted  nodes 
sharing  a  group  key.  Nodes  can  belong  to  multiple  PLTDs. 
The  unique  part  of  this  work  is  that  it  considers  the  trust  level 
of  each  node  in  a  physical  as  well  as  a  logical  sense,  e.g., 
it  considers  both  one-hop  nodes  as  well  as  previously  trusted 
nodes  that  are  not  currently  one-hop  neighboring  nodes.  The 
significant  merit  of  this  work  is  in  formalizing  a  trust  metric 
reflecting  trust  decay  over  time  and  updating  trust  as  dynamics 
of  the  network  change.  However,  establishing  pair- wise  keys 
based  on  pair-wise  trust  may  not  be  feasible  in  terms  of 
scalability  and  in  the  presence  of  high  network  dynamics  in  a 
large  MANET. 

Hadjichristofi  et  al.  [63]  presented  a  key  management 
framework  that  provides  redundancy  and  robustness  in  the 
establishment  of  Security  Association  (SA)  between  pairs  of 
nodes.  Their  proposed  key  management  system  (KMS)  adopts 
a  modified  hierarchical  Public  Key  Infrastructure  (PKI)  model 
where  nodes  can  dynamically  take  management  roles.  The 
scheme  is  designed  to  provide  high  service  availability  based 
on  trust-based  SA  among  nodes.  However,  trust  relationships 
are  derived  solely  from  certificate  chains.  Adams  et  al.  [44] 
also  extended  their  prior  work  [63]  with  a  node-centric  rep- 
utation management  approach  that  considers  feedback  about 
a  node's  behavior  in  generating  a  reputation  index  to  de- 
termine the  trustworthiness  of  its  peers  before  establishing 


202 


IPSec  security  associations.  They  considered  the  decay  of  trust 
over  time  using  a  three- window  weighted  average.  They  also 
derived  reputation  values  from  past  experiences  and  current 
observations  and  introduced  a  rehabilitation  mechanism  to 
give  a  second  chance  to  bad  nodes.  However,  no  details 
were  given  on  the  type  of  information  that  should  be  directly 
observed  to  derive  reputation. 

Li  et  al.  [100]  demonstrated  an  on-demand,  fully  local- 
ized, and  hop-by-hop  public  key  management  protocol  for 
MANETs.  In  this  protocol  each  node  generates  its  own 
public/private  key  pairs,  issues  its  certificate  to  neighboring 
nodes,  keeps  received  certificates  in  its  certificate  repository, 
and  provides  authentication  service  by  adapting  to  the  dynamic 
network  topology,  without  reliance  on  any  centralized  server. 
However,  only  certificate  chains  are  used  to  derive  trust. 

Chang  and  Kou  [101]  proposed  a  Markov  chain  trust  model 
to  obtain  the  trust  values  (TVs)  for  1-hop  neighbors.  They  de- 
signed a  trust-based  hierarchical  key  management  scheme  by 
selecting  a  certificate  authority  server  (CA)  and  a  backup  CA 
with  the  highest  TVs.  This  work  gives  a  rigorous  analysis  of 
TVs  and  considers  a  variety  of  attacks.  However,  it  computes 
TVs  only  based  on  direct  observations  and  does  not  consider 
trust  decay  due  to  using  recommendations  from  remote  nodes. 

A  survey  of  key  management  techniques  for  network-layer 
security  may  be  found  in  the  work  by  Hegland  et  al.  [102]. 

In  contrast  to  secure  routing  that  produces  an  operational 
MANET,  authentication,  intrusion  detection,  access  control, 
and  key  management  are  general  trust  contexts  that  also  exist 
outside  the  area  of  MANETs.  In  these  applications,  it  is 
useful  to  abstract  out  the  properties  of  MANETs  and  consider 
only  the  influence  of  MANETs  on  any  information/evidence 
gathering,  aggregation,  and  other  computation,  and  design  a 
trust  management  scheme  that  considers  influences  such  as 
the  cost/likelihood  of  obtaining  a  piece  of  information  in 
computing  trust. 

F.  Trust  Evidence  Distribution  and  Evaluation 

Several  trust  management  schemes  have  been  proposed 
in  order  to  provide  a  general  framework  for  trust  evidence 
distribution  or  evaluation  in  MANETs. 

Yan  et  al.  [64]  proposed  a  trust  evaluation  based  security 
solution  for  data  protection,  secure  routing,  and  other  network 
activities.  This  trust  evaluation  model  called  Personal  Trusted 
Bubble  (PTB)  considers  many  factors  including  experience 
statistics,  data  value  (the  higher  the  value  of  the  data,  the 
higher  is  the  trust  needed  from  other  PTEs  to  transfer  it), 
intrusion  black  list,  reference  (reputation/recommendation), 
personal  preference,  and  PTB  policy  (related  to  the  entire 
network's  security  requirements  and  policy).  Interestingly,  per- 
sonal preference  and  PTB  reflect  the  subjective  characteristic 
of  trust  in  deriving  trust  values.  Yan  et  al.  [64]  do  not 
validate  whether  their  proposed  trust  management  is  correct 
or  useful  compared  to  the  actual  trust  levels,  say,  based  on 
trustworthiness  in  Josang  and  Solhaug's  terminology.  In  gen- 
eral, validation  of  trust  models  is  difficult,  given  the  inherent 
subjectivity  in  the  trust  metric,  but  it  is  also  critical.  Jiang 
and  Baras  [103]  proposed  a  trust  distribution  scheme  called 
ABED  (Ant-Based  trust  Evidence  Distribution)  based  on  the 
swarm  intelligence  paradigm,  which  is  highly  distributed  and 


adaptive  to  mobility.  The  swarm  intelligence  paradigm  is 
widely  used  in  dynamic  optimization  problems  (e.g.,  the  trav- 
eling salesman  problem,  routing  in  communication  networks). 
The  key  principle  in  swarm  interaction  is  called  stigmergy, 
indirect  communication  through  the  environment.  In  ABED, 
"pheromones"  are  deposited  at  nodes  by  mobile  agents  called 
"ants"  and  provide  the  mechanism  for  information  exchange 
and  interactions.  These  "ants"  can  identify  the  optimal  path 
toward  their  food,  resembling  trust  evidence  in  this  case. 
The  pheromone  regulation  process  is  known  to  be  suitable 
for  dynamically  changing  environments  such  as  MANETs. 
However,  no  specific  attackers  are  considered  to  prove  the 
robustness  of  the  proposed  scheme  in  the  presence  of  attacks. 

In  the  continuing  work,  Baras  and  Jiang  [104]  addressed 
distributed  trust  computation  and  establishment  using  random 
graph  theory.  This  work  uses  the  theory  of  dynamic  coop- 
erative games  and  identifies  how  a  phase  transition  from  a 
distrusted  state  to  a  trusted  state  can  occur  in  a  dynamic 
MANET.  This  work  is  unique  in  that  it  describes  how  phase 
transitions  occur  in  MANETs  and  how  these  are  related  to 
node  mobility  and  network  topology  in  the  process  of  initial 
trust  establishment.  Trust  relationships  are  ternary  (yes,  no, 
don't  care)  and  the  emphasis  is  on  understanding  steady- state 
behaviors.  Incorporating  continuous  valued  trust  variables, 
dynamics,  and  transient  behaviors  in  this  framework  would 
be  useful. 

Theodorakopoulos  and  Baras  [50]  proposed  a  trust  evidence 
evaluation  scheme  for  MANETs.  The  evaluation  process  is 
modeled  as  a  path  problem  in  a  directed  graph  where  vertices 
represent  entities  and  edges  represent  trust  relations.  The 
authors  employed  the  theory  of  Semirings  to  show  how  two 
nodes  can  establish  trust  relationships  without  prior  direct 
interactions.  Their  case  study  uses  the  PGP  web  of  trust 
to  express  an  example  trust  model  based  on  Semirings  and 
shows  that  their  scheme  is  robust  in  the  presence  of  attackers. 
However,  their  work  assumes  that  trust  is  transitive.  Further, 
trust  and  confidence  values  are  represented  as  binary  rather 
than  continuous  values.  Even  though  no  centralized  trusted 
third  party  exists,  their  work  makes  use  of  a  source  node 
as  a  trusted  infrastructure,  which  introduces  vulnerability  in 
MANETs. 

Recently,  Boukerche  and  Ren  [105]  proposed  a  distributed 
reputation  management  mechanism  called  GRE  (Generalized 
Reputation  Evaluation),  using  a  comprehensive  computational 
reputation  model.  GRE  seeks  to  prevent  malicious  nodes  from 
entering  a  trusted  community.  However,  no  specific  attack 
model  was  addressed. 

Moloney  and  Weber  [106]  presented  a  trust-based  security 
system  that  generates  appropriate  trust  levels  based  on  the 
consideration  of  the  main  characteristics  of  MANETs  as 
well  as  context- awareness.  The  scheme  leverages  two  existing 
projects  at  Trinity  College,  Dublin,  called  SECURE  and  Aithe. 
SECURE  is  used  for  trust  management  using  a  trust  engine 
and  a  risk  engine  while  Aithe  collects  and  manages  context 
information  forwarded  from  sensors.  It  is  worthwhile  to  extend 
this  work  to  consider  attacks  that  can  propagate  incorrect 
information  to  generate  trust  levels. 

Very  recently,  Cho  et  al.  [107]  proposed  a  trust  management 
scheme  for  group  communication  systems  in  MANETs.  This 
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work  proposed  a  composite  trust  metric  reflecting  various 
aspects  of  a  MANET  node  such  as  sociability  (i.e.,  social 
trust)  and  task  performance  capability  (i.e.,  QoS  trust),  and 
investigated  the  effect  of  the  trust  chain  length  used  by  a 
node  to  establish  acceptable  trust  levels  through  subjective 
trust  evaluation.  They  also  discussed  the  concept  of  objective 
trust  evaluation  based  on  global  knowledge  as  the  basis  of 
validating  subjective  trust  evaluation.  More  work  remains  to 
be  done  to  ascertain  feasibility. 

The  Appendix  summarizes  trust  management  schemes  sur- 
veyed in  this  section.  In  the  Appendix,  the  methodology 
explains  how  trust  evidence  is  collected  and  performance 
metrics  refer  to  the  metrics  used  to  evaluate  various  trust 
management  schemes. 

V.  Future  Research  Discussion 

It  is  clear  that  sooner  or  later  intelligence  will  be  em- 
bedded in  each  node  with  cognitive  functionality,  adopting 
recent  ideas  about  cognitive  networks  in  wireless  networks 
[108].  Mahmoud  [108]  defines  a  cognitive  network  as  having 
a  cognitive  process  that  is  capable  of  perceiving  current 
network  conditions  and  then  planning,  deciding,  and  acting 
on  those  conditions.  Cognitive  networks  are  able  to  recon- 
figure the  network  infrastructure  based  on  past  experiences 
by  adapting  to  continuously  changing  network  behaviors  to 
improve  scalability  (e.g.,  reducing  complexity),  survivability 
(e.g.,  increasing  reliability),  and  QoS  (e.g.,  facilitating  co- 
operation among  nodes)  as  proactive  mechanisms  [48]  [108]. 
We  suggest  using  this  concept  of  cognitive  networks  so  that 
nodes  can  adapt  to  changing  network  behaviors,  such  as 
attacker  behaviors,  degree  of  hostility,  node  disconnection  due 
to  physical  environment  such  as  terrain,  energy  depletion,  or 
voluntary  disconnection  for  energy  saving.  Cognition  is  more 
than  adaptation;  it  incorporates  learning  and  reasoning. 

Another  potentially  fruitful  research  direction  is  to  use 
social  relationships  in  evaluating  trust  among  collaborators  in 
a  group  setting  by  employing  the  concept  of  social  networks. 
Golbeck  et  al.  [37]  [38]  [39]  define  a  social  network  as  a 
social  structure  of  individuals  who  may  be  related  directly  or 
indirectly  to  each  other  in  order  to  pursue  common  interests. 
Yu  et  al.  [109]  and  Maheswaran  et  al.  [110]  use  social 
networks  to  evaluate  the  trust  value  of  a  node.  Examples 
of  social  networks  are  strong  social  relationships  including 
colleagues  or  relatives,  membership  in  the  same  platoon,  and 
loose  social  relationships  including  school  alumni  or  friends 
with  common  interests  or  membership  in  coalition  activities. 
Social  trust  may  include  friendship,  honesty,  privacy,  and 
social  reputation  or  recommendation  derived  from  direct  or 
indirect  interactions  for  "sociable"  purposes.  In  MANETs, 
metrics  used  to  measure  these  social  trust  properties  can  be 
frequency  of  communications,  malicious  or  benign  behaviors 
(e.g.,  false  accusation  or  recommendation,  impersonation), 
private  information  revealed,  and  quality  of  reputation.  The 
notion  of  social  trust  is  being  incorporated  into  communication 
networks.  Trust  propagation  models,  some  based  on  notions  of 
social  networking,  have  been  proposed  in  multi-agent  systems 
[114]  [115]  [116]. 

An  important  and  interesting  research  direction  is  to  con- 
struct a  composite  trust  metric  based  on  social  trust  and 


other  trust  components  representing  quality-of- service  (QoS) 
to  successfully  perform  tasks  to  meet  both  performance  and 
trust  requirements.  We  have  seen  some  work  in  the  literature 
moving  in  this  direction.  Cho  et  al.  considered  honesty  and 
intimacy  (for  social  trust),  and  unselfishness  and  energy  (for 
QoS  trust)  for  trust  evaluation  [107].  Kohlas  et  al.  [Ill]  used 
honesty,  competency,  reliability,  and  maliciousness  and  their 
corresponding  negations  as  trust  components  to  define  trust 
relationships.  Yin  et  al.  [112]  computed  composite  reputation 
values  of  peers  based  on  evidences  from  various  domains 
such  as  customers'  reputation  scores  or  ranks  in  commercial 
sites  or  the  certified  roles  in  certain  organizations  with  dif- 
ferent weights  indicating  the  importance  and  robustness  of 
the  reputation  computation  processes.  Boursas  and  Hommel 
[113]  considered  QoS  aspects  such  as  the  visual  quality  in 
multimedia  and  commitment  in  interactions  to  calculate  node 
trust  levels  in  large  distributed  systems.  More  work  remains 
to  be  done  to  understand  the  best  combination  of  social  trust 
versus  QoS  trust  components  used  to  construct  the  composite 
trust  metric,  as  well  as  the  best  weights  associated  with  social 
trust  and  QoS  trust,  especially  when  given  application  context 
information  for  critical  mission  executions  in  MANETs. 

Not  much  work  has  been  done  in  trust  management  for 
mobile  vehicular  systems.  A  trust  architecture  for  vehicular 
networks  is  proposed  in  [117]  that  incorporates  a  policy 
control  model,  a  proactive  trust  model,  and  a  social  network 
based  system,  and  takes  into  account  dynamics.  When  the 
environment  is  volatile,  associating  trust  with  data  becomes 
even  more  challenging;  a  solution  is  provided  in  [117]  and  a 
case  study  is  discussed  in  the  context  of  vehicular  networks. 

The  overall  qualities  of  trust  in  decision  making  de- 
pends on  complex  interactions  between  the  information,  so- 
cial/cognitive, and  communications  networks.  Trust  metrics 
might  be  separately  defined  in  each  of  the  networks,  but 
the  key  issue  is  to  elucidate  the  mapping  of  qualitative  and 
quantitative  metrics  across  the  networks,  to  define  an  end- 
to-end  notion  of  composite  trust,  to  determine  the  attributes 
(presumably  many  others  than  trust)  in  the  different  networks 
that  affect  this  composite  metric,  and  identify  those  that  can 
be  controlled  and  those  that  cannot  [118],  especially  for  trust 
management  in  a  coalition  environment  [119]. 

We  suggest  that  the  following  design  concepts  be  considered 
for  building  MANET  trust  management  systems: 

•  A  trust  metric  must  reflect  the  unique  properties  of  trust 
in  MANETs,  including  possibly  imperfect  transitivity, 
asymmetry,  subjectivity,  non-binary  nature,  decay  over 
time  and  space,  dynamicity,  and  context-dependency. 

•  A  trust  metric  must  incorporate  adequate  trust  compo- 
nents (e.g.,  social  trust  and  QoS  trust)  capable  of  reflect- 
ing mission  difficulty  (e.g.,  high  risk  upon  task  failure), 
changing  network  environments  (e.g.,  lack  of  bandwidth, 
increasingly  hostile  environment  as  attackers'  strength 
increases,  high  communication  load),  and  conditions  of 
participating  nodes  (e.g.,  low  energy,  compromised  sta- 
tus). 

•  A  trust  management  design  must  support  cognitive  func- 
tionality for  each  node  to  achieve  adaptability  to  changing 
network  conditions  and  MANET  environments  including 
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node  density,  node  mobility  patterns,  scheduling  algo- 
rithms, and  traffic  patterns. 

•  A  trust  management  system  should  be  situation  specific 
or  situation  aware  [120]  [121]  [122].  Situational  awareness 
includes  mission  contexts  and  requirements  in  terms  of 
security,  performance  and  reliability.  Depending  on  the 
required  levels  of  security,  performance  and/or  reliability, 
a  different  level  of  trust  can  be  adopted  reflecting  mission 
contexts  and  situations. 

•  A  trust  metric  must  adequately  reflect  tradeoffs  in  altru- 
ism versus  selfishness,  trust  versus  reliability,  availability, 
survivability,  or  security  so  as  to  contribute  to  improved 
system  performance.  In  addition,  since  gathering  infor- 
mation from  spatially  remote  areas  will  consume  more 
resources  (e.g.,  time  or  energy)  but  improve  decision 
making,  one  should  investigate  the  tradeoff  between 
resource  consumption  and  decision  making  accuracy  and 
timeliness.  One  may  utilize  aggregation  technique  to 
reduce  resource  consumption  in  obtaining  information 
from  distant  nodes. 

•  A  trust  management  design  must  allow  optimal  settings 
to  be  identified  under  various  network  and  environmental 
conditions  so  as  to  maximize  the  overall  trust  of  the  sys- 
tem for  successful  mission  executions.  Equally  important 
is  an  understanding  of  sensitivity  to  deviations  from  the 
optimal  settings. 

•  There  has  been  no  comparison  of  trust  management 
schemes  versus  conventional  security  schemes  in  terms 
of  metrics  of  interest  in  MANETs.  One  example  could 
be  the  comparison  of  trust  management  schemes  to 
cryptographic  schemes  in  detecting  misbehaving  nodes. 

•  Local  trust  is  easy  to  understand  and  compute,  since  it 
only  involves  tracking  behaviors  of  neighboring  nodes. 
Local  trust  is  easy  to  defend  from  malicious  attacks. 
Global  trust  is  harder  to  compute  and  update;  Eigentrust 
[123]  is  an  example  of  a  global  trust  metric.  But  a 
non-local  definition  of  trust  is  subject  to  subversion  and 
manipulation  by  colluding  nodes.  Zhang  et  al.  [124] 
provide  a  robust  version  of  the  Eigentrust  algorithm.  A 
critical  question  is:  is  trust  inherently  local?  How  can  a 
global  trust  metric  be  computed  and  distributed  reliably? 

•  Recently,  social  trust  derived  from  social  networks  has 
received  considerable  attention  for  establishing  trust  in 
various  applications.  MANET  designers  may  also  want 
to  take  into  account  social  trust. 

•  The  survey  has  focused  on  a  trust  value  associated  with 
individual  nodes.  But  often  we  may  be  interested  in 
associating  trust  with  data  or  with  a  group  of  nodes  or 
entities.  Many  of  the  concepts  discussed  here  will  extend 
naturally. 

VL  Concluding  Remarks 

Trust  is  a  multidimensional,  complex,  and  context- 
dependent  concept.  Although  trust-based  decision  making  is 
in  our  everyday  life,  trust  establishment  and  management 
in  MANETs  face  challenges  due  to  the  severe  resource 
constraints,  the  open  nature  of  the  wireless  medium,  the 
complex  dependence  between  the  communications,  social  and 
application  networks,  and,  hence,  the  complex  dependency 


of  any  trust  metric  on  features,  parameters,  and  interactions 
within  and  amongst  these  networks. 

In  this  paper,  we  surveyed  and  analyzed  existing  trust 
management  schemes  in  MANETs  to  provide  MANET  trust 
network  protocol  designers  with  multiple  perspectives  on  the 
concept  of  trust,  an  understanding  of  trust  properties  that 
should  be  observed  in  developing  trust  metrics  for  evaluating 
trust,  and  insights  on  how  a  trust  metric  can  be  customized 
to  meet  the  requirements  and  goals  of  the  targeted  system.  A 
composite  trust  metric  that  captures  aspects  of  communica- 
tions and  social  networks,  and  corresponding  trust  measure- 
ment, trust  distribution,  and  trust  management  schemes  are 
interesting  research  directions.  For  dynamic  networks,  such 
as  military  MANETs,  these  schemes  should  have  desirable 
attributes  such  as  ability  to  adapt  to  environmental  dynamics, 
scalability,  reliability,  and  reconfigurability. 

Acknowledgment 

This  project  is  supported  in  part  by  an  appointment  to  the 
U.S.  Army  Research  Laboratory  Postdoctoral  Fellowship  Pro- 
gram administered  by  the  Oak  Ridge  Associated  Universities 
through  a  contract  with  the  U.S.  Army  Research  Laboratory. 
The  authors  appreciate  the  many  discussions  with  and  critical 
insights  provided  by  our  internal  ARL  staff  including  Eliza- 
beth Bowman,  Kevin  Chan,  Natalie  Ivanic,  and  Brian  Rivera. 
The  authors  also  give  special  thanks  to  Dakshi  Agrawal  and 
Mudhakar  Srivatsa  from  IBM  T.  J.  Watson  Research  Center 
for  their  valuable  comments. 

Appendix 

a  survey  on  existing  trust  management  schemes 
IN  MANETs 

See  Tables  I- VI. 

References 

[1]  S.  Corson  and  J.  Macker,  "Mobile  Ad  Hoc  Networking  (MANET): 
Routing  Protocol  Performance  Issues  and  Evaluation  Considerations," 
RFC  2501,  Jan.  1999. 

[2]  J.  Jubin  and  J.  Tornow,  "The  DARPA  Packet  Radio  Network  Protocols," 
Proc.  IEEE,  vol.  75,  no.  1,  Jan.  1987,  pp.  21-32. 

[3]  A.  J.  Tardiff  and  J.W.  Gowens,  Editors,  "ARL  Advanced  Telecom- 
munication and  Information  Distribution  Research  Program  (ATIRP)," 
Final  Report,  1996-2001,  June  2001. 

[4]  T.  Plesse,  J.  Lecomte,  C.  Adjih,  M.  Badel,  P.  Jacquet,  A.  Laouiti,  P. 
Minet,  P.  Muhlethaler,  and  A.  Plakoo,  "OLSR  Performance  Measure- 
ment in  a  Military  Mobile  Ad  Hoc  Network,"  Proc.  24th  Int'l  Conf. 
on  Distributed  Computing  Systems,  2004,  pp.  704-709. 

[5]  K.  S.  Cook  (editor).  Trust  in  Society,  vol.  2,  Feb.  2003,  Russell  Sage 
Foundation  Series  on  Trust,  New  York. 

[6]  M.  Blaze,  J.  Feigenbaum,  and  J.  Lacy,  "Decentralized  Trust  Manage- 
ment," Proc.  IEEE  Symposium  on  Security  and  Privacy,  6-8  May,  1996, 
pp.  164  -  173. 

[7]  R.  B.  Bobba,  L.  Eschenauer,  V.  Gligor,  and  W.  Arbaugh,  "Bootstrap- 
ping Security  Associations  for  Routing  in  Mobile  Ad  Hoc  Networks," 
Proc.  IEEE  GLOBECOM,  San  Francisco,  CA,  Dec.  2003,  pp.l511- 
1515. 

[8]  L.  Eschenauer,  V.  D.  Gligor,  and  J.  Baras,  "On  Trust  Establishment 
in  Mobile  Ad  Hoc  Networks,"  Proc.  10th  Int'l  Security  Protocols 
Workshop,  Cambridge,  U.K.,  Apr.  2002,  vol.  2845,  pp.  47-66. 

[9]  J.  S.  Baras  and  T.  Jiang,  "Managing  Trust  in  Self-Organized  Mobile 
Ad  Hoc  Networks,"  Proc.  12th  Annual  Network  and  Distributed  System 
Security  Symposium  Workshop,  Feb.  2005,  San  Diego,  CA. 
[10]  S.  Ruhomaa  and  L.  Kutvonen,  "Trust  Management  Survey,"  P.  Her- 
rmann et  al.  (Eds.),  iTrust  2005,  Lecture  Notes  in  Computer  Science, 
vol.  3477,  pp.  77-92. 


205 


TABLE  I 


Authors,  Year, 
Ref.  no 

Purpose 
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ered 
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metrics 

Trust  property 

Trust  management  model 
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racy 
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Trust  revocation  introducing 
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Underlying  routing  protocol: 
DSR 
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[68] 
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malicious  packet 

forwarding 

r»o<i; 
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sitivity 
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An  extension  of  DSR  using 
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altruism  and  utilitarianism 

Buchegger 

&  Boudec 
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Reputation 
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defection  (e.g., 
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Utility^ 
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behaving nodes 
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nonergodic  fluctuations.  We  argue  that  this  is  a  general  property  of  criticality,  thereby  opening  the  door  to 
the  application  of  the  recently  discovered  phenomenon  of  complexity  matching:  For  an  efficient  transfer  of 
information  to  occur,  a  perturbing  complex  network  must  share  the  same  temporal  complexity  as  the  perturbed 
complex  network. 
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I.  INTRODUCTION 

Phase  transitions  and  critical  phenomena  occur  frequently 
in  nature  and  have  been  widely  studied  by  physicists,  see 
for  instance  [1].  The  Ising  model  [2]  originally  introduced  to 
explain  ferromagnetic  phase  transition  is  well  known,  and  the 
exact  solution  found  by  Onsager  [3]  for  the  occurrence  of  phase 
transition  in  the  two-dimensional  case  is  widely  recognized  as 
an  example  of  outstanding  theoretical  achievement.  In  the  last 
few  years  some  scientists  have  used  the  Ising  model  to  shed 
light  on  biological  and  neurophysiological  processes  [4-7]. 
More  precisely,  the  authors  of  [4]  used  the  Ising  model 
to  explain  the  collective  behavior  of  biological  networks 
and  the  authors  of  [5-7]  adopted  the  Ising  model  for  the 
purpose  of  supporting  their  hypothesis  that  the  brain  works 
at  criticality  without  establishing  a  clear  distinction  between 
phase  transition  and  self-organized  criticality  [8].  Finally, 
we  have  to  mention  that  the  Ising  model  is  frequently  used,  see 
for  instance  [9,10],  to  model  neurophysiological  data  with  the 
constraint  of  maximal  entropy.  Although  relevant  to  explore 
the  cognition  properties  of  the  brain,  this  is  a  perspective 
different  from  the  one  adopted  in  this  paper. 

Couzin  [11]  made  the  conjecture  that  the  cooperative 
interaction  between  the  birds  of  a  flock  may  mimic  the 
brain  cognition  properties,  and  the  experimental  observation 
of  Cavagna  et  al.  [12]  confirmed  the  criticality  condition  of 
a  coordinated  flock  of  birds.  It  is  important  to  emphasize  that 
the  connection  between  phase  transition  processes  and  coop- 
erative behavior  of  biological  systems  was  already  stressed  in 
the  pioneering  paper  of  Vicsek  et  al.  [13].  It  is  thought  [4]  that 
biological  systems  in  general  operate  at  criticality. 

The  phase  transition  condition  is  not  confined  to  biological 
systems.  The  recent  work  on  econophysics  [14,15]  suggests 
that  the  same  arguments  can  be  applied  to  sociological 
systems.  As  a  matter  of  fact,  Somette  [15]  proposes  a  model 
of  cooperative  economical  interaction,  of  the  same  type  as 
the  Ising  model,  which  generates  at  criticality  a  transition 
from  subcritical  to  supercritical  patterns  very  similar  to  those 
produced  by  the  Ising  model.  As  stressed  by  the  authors 
of  [5],  the  Ising  patterns  at  criticality  correspond  to  the 
emergence  of  correlation  links  yielding  a  scale-free  network 
statistically  indistinguishable  from  that  experimentally  ob- 
served within  the  brain,  using  functional  magnetic  resonance 
imaging. 


These  studies  emphasize  the  spatial  and  network  com- 
plexity emerging  from  the  cooperative  interaction  of  the 
network's  units,  but  overlook  the  temporal  complexity  of 
these  networks.  Herein  we  are  attempting  to  fill  this  gap 
and  prove  that  the  temporal  complexity  emerges  at  criticality. 
Temporal  complexity  is  defined  as  follows.  The  time  dynamics 
of  complex  networks  is  characterized  by  the  occurrence  of 
significant  events,  which  may  be  financial  crashes  [15],  brain 
quakes  [5,6],  or  the  changes  of  direction  of  a  flock  of  birds 
[12,13].  The  time  interval  between  two  consecutive  events 
is  given  by  a  distribution  density  if{t),  which,  drastically 
departing  from  the  conventional  Poisson  statistics,  has  the 
inverse-power-law  form 

Hr)<x\^  (1) 

with  /X  <  2.  The  occurrence  of  an  event  does  not  have  any 
memory  of  the  occurrence  of  earlier  events.  This  property  is 
usually  denoted  as  renewal,  but  it  must  not  be  confused  with  the 
ordinary  Poisson  and  Markov  condition:  The  signal  generated 
by  these  events  is  characterized  by  long-range  correlation  in 
time,  and,  most  importantly,  it  is  essentially  nonstationary, 
thereby  breaking  the  ergodicity  that  is  a  fundamental  property 
of  statistical  physics  [16].  We  refer  to  it  as  the  non-Poisson 
renewal  condition.  A  significant  event  is  interpreted  as  a  failure 
whose  occurrence  brings  the  network  back  to  a  brand  new 
condition.  We  refer  to  these  as  crucial  events. 

Events  of  this  kind  may  be  very  difficult  to  detect.  The 
theoretical  discussion  [16],  for  instance,  applies  to  the  case  of 
dichotomous  signals,  where  crucial  events  correspond  to  the 
abrupt  transition  from  one  to  the  other  value  of  the  signal.  In 
the  case  of  the  decision  making  model  (DMM)  under  study,  the 
single  units  in  isolation  produce  perfect  dichotomous  signals. 
They  can  be  easily  detected,  but  they  are  of  a  Poisson  kind, 
thereby  strikingly  departing  from  the  condition  of  temporal 
complexity.  When  the  control  parameter  is  extremely  large,  the 
global  signal  undergoes  abrupt  transitions  that  however  depart 
again  from  the  condition  of  temporal  complexity,  because 
they  are  predominantly  of  a  Poisson  kind.  At  criticality  the 
global  signal  loses  these  abrupt  transitions  that  are  replaced 
by  smoother  fluctuations  around  the  mean  value.  Herein  we 
consider  a  model  generating  a  signal  whose  time  mean  value 
vanishes.  We  assume  the  crossing  of  the  origin  to  be  the 
proxy  of  a  significant  event.  Thus,  we  interpret  the  signal  as  a 
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dichotomous  fluctuation  making  abrupt  changes  from  1  to  —  1 
and  back  again.  We  evaluate  the  correlation  function  of  this 
ideal  dichotomous  signal,  denoted  by  the  symbol  ^(t,t^),  and 
we  prove  that  its  dependence  on  the  times  t  and  exactly  fits 
the  prediction  of  the  non-Poisson  renewal  condition. 

The  connection  between  phase  transition,  and  its  spatial 
and  network  complexity,  with  temporal  complexity  is  not  well 
known.  In  the  literature  there  are  only  a  few  papers  where  this 
connection  is  discussed.  One  is  [17],  which  converts  the  3D 
Ising  model  into  a  critical  map  generating  type  I  intermittent 
dynamics  and  consequently  the  same  temporal  complexity 
as  the  one  under  discussion  herein.  The  emergence  of  non- 
Poisson  renewal  properties  was  also  discussed  in  [18,19], 
which  is  devoted  to  illustrating  the  cooperative  properties  of 
a  DMM.  The  foundation  of  these  papers  is  sociological  and 
similar  to  that  of  the  economical  model  illustrated  in  the  work 
of  Sornette  [15].  Thus,  the  authors  [18,19]  did  not  discuss  the 
connection  of  the  DMM  with  the  Ising  model  and  adopted  the 
nonrealistic  but  often  made  assumption  of  all-to-all  coupling 
among  the  network's  units,  thereby  confining  the  emergence 
of  an  inverse  power  law,  namely,  of  temporal  complexity,  to 
the  same  time  scale  as  that  of  the  units  in  isolation. 

The  motivation  for  the  present  research  work  is  given  by 
the  transfer  of  information  from  a  complex  network  to  another 
complex  network  [20].  Rather  than  using  entropic  arguments, 
as  is  usually  done  with  the  transfer  of  information,  we  open 
the  door  to  the  concept  of  complexity  management  recently 
advocated  in  [21].  This  latter  work  established  that  a  complex 
network  driven  by  crucial  events  is  sensitive  to  stimuli  with  the 
same  complexity.  In  other  words,  the  transfer  of  information 
rests  on  temporal  complexity  and  this  explains  the  motivation 
to  prove  that  criticality  also  generates  temporal  complexity. 

In  Sec.  II  we  discuss  under  which  conditions  the  DMM  [18, 
19]  is  equivalent  to  the  Ising  model.  In  Sec.  Ill  we  prove  that 
at  criticality  the  function  \l/(t)  becomes  an  inverse  power  law 
for  over  four  decades.  We  shall  also  argue  that  with  increasing 
the  network's  size  \l/(t)  may  become  an  infinitely  extended 
inverse  power  law.  Section  IV  shows  that  this  is  a  renewal 
process.  Finally  we  devote  Sec.  V  to  concluding  remarks. 

II.  DETAILED  DESCRIPTION  OF  THE  MODEL 

We  consider  a  system  of  L  discrete  variables  located  at  the 
nodes  of  a  two-dimensional  square  lattice.  Each  unit  st  is  a 
stochastic  oscillator  and  can  be  found  in  either  of  two  states, 
+  1  or  —1.  The  dynamic  is  introduced  by  choosing  a  single 
unit  on  site  /  and  updating  it  in  an  elementary  time  step  with  a 
transition  rate  g: 
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Here  M  denotes  the  total  number  of  nearest  neighbors,  and 
M+i  and  M_i  the  number  of  nearest  neighbors  being  in  the 
state  +1  and  —1,  respectively.  Single  units  change  its  states, 
thereby  making  M+i  and  M_i  fluctuate  in  time,  while,  of 
course,  the  total  number  of  nearest  neighbors  is  conserved, 
M+i  +  M_i  =  M. 
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FIG.  1 .  (a)  Temporal  evolution  of  a  single  unit  Si  (t)  and  (b)  of  the 
global  order  parameter  ^(t)  for  the  decision  making  model  realized  on 
a  square  lattice  of  L  =  50  x  50  nodes,  with  go  =  0.01  and  K  =  \  .70. 
To  illustrate  the  concept  of  crucial  events  we  mark  the  time  intervals 
r  between  two  consecutive  events,  according  to  their  definitions 
assumed  in  this  paper.  Notice  different  time  scales  on  both  plots. 

All  numerical  calculation  are  performed  on  a  square  lattice 
of  either  L  =  50  x  50  or  L  =  100  x  100  nodes  with  periodic 
boundary  conditions.  In  a  single  time  step  a  run  over  the  whole 
lattice  is  performed  and  for  every  unit  Si  the  transition  rate  of 
Eqs.  (2)  or  (3)  is  calculated  according  to  which  a  node  is  given 
the  possibility  to  change  its  state.  The  single  unit  in  isolation, 
K  =  0,  fluctuates  between  states  + 1  and  —  1  with  the  transition 
rate  ^  =  ^o-  When  coupling  constant  K  >  0,3.  unit  in  the  state 
+  1  (—1)  makes  a  transition  to  the  state  —1  (+1)  faster  or 
slower  according  to  whether  M_i  >  M+i  (M+i  >  M_i)  or 
M_i  <  M+i  (M+i  <  M_i),  respectively. 

Next,  we  define  the  global  order  parameter  ^(t)  = 
■lY^f=iSi(t),  which  is  characterized  by  the  variability  that 
does  not  possess  the  dichotomous  character  of  single  units.  In 
Fig.  1  we  show  exemplary  temporal  evolution  for  the  single 
unit  Si(t)  and  for  the  global  order  parameter  ^(t).  Note  that 
the  amplitude  of  variable  ^(t)  depends  on  the  value  of  the 
coupling  constant  K.  When  K  =  0,  all  units  in  the  system  are 
independent  Poisson  processes;  thereby  an  average  taken  at 
any  moment  of  time  over  all  of  them  is  zero.  Once  the  value 
of  the  coupling  increases,  K  >  0,  single  units  are  less  and 
less  independent,  resulting  in  a  nonzero  average.  The  quantity 
Kc  is  the  critical  value  of  the  control  parameter  K,  at  which 
point  a  phase  transition  to  a  global  majority  state  occurs.  In 
numerical  calculations  we  use  the  time  average  ^eq  =  (l§(OI> 
as  a  measure  of  this  global  majority.  More  precisely,  after  initial 
10^  time  steps,  the  average  is  taken  over  the  same  number  of 
the  consecutive  time  steps  of  the  model. 

Note  that  in  the  special  case  when  M  is  the  same  for  all  the 
nodes  and  ^  1,  we  find  that  our  model  generates  the  same 
phase  transition  as  the  two-dimensional  Ising  model  discussed 
in  the  seminal  paper  of  Onsager  [3].  This  is  an  expected  result 
insofar  as  the  Ising  model  rests  on  the  Hamiltonian 
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where  a/  and  aj  denote  the  PauH  operators  with  eigenvalues 
lb  1 .  According  to  Boltzmann  statistics  equilibrium  is  described 
by  the  density  matrix 


1.00 


P  = 


(5) 


with  P  =  kB  being  the  Boltzmann  constant  and  T  the 
absolute  temperature.  Thus,  the  off-diagonal  elements  of  the 
transfer  matrix  [22]  become  equivalent  to  the  transition  rates 
of  Eqs.  (2)  and  (3),  under  the  condition 


/  M 
keT  =  —  =  — . 
K  K 


(6) 


As  examples  of  conditions  yielding  this  equivalence,  we 
consider  two  cases.  The  former  is  the  case  of  all-to-all 
coupling,  where  M  =  L.  The  latter  is  considered  here,  and 
it  is  the  two-dimensional  lattice  where  each  node  is  coupled  to 
its  4  nearest  neighbors,  thereby  setting  M  =  4. 

The  thermodynamical  condition  M  =  L  =  oo  was  dis- 
cussed extensively  by  authors  of  [18,19],  who  showed  that 
under  those  conditions  the  ratios  ^  are  equivalent  to  the 
probabilities  p±i  for  a  node  Si  to  be  in  one  of  two  allowed 
states.  The  dynamic  evolution  of  a  single  unit  state  is  then 
described  by  a  two- state  master  equation 
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Solving  the  above  for  the  difference  in  probabilities  Y\{t)  = 
p^iit)  —  p-i{t),  which  corresponds  to  the  earlier  defined 
global  order  parameter  § (t),  one  obtains 

d 

—  n  =  2^o[sinh(/^n)  -  O  cosh(/^n)].  (9) 
dt 

This  equation  yields  two  solutions,  corresponding  to  global 
majority  states,  for  the  values  of  coupling  constant  K  >  Kc, 
where  Kc  =  1. 

The  solution  to  the  latter  condition  of  M  =  4,  L  =  oo  can  be 
found  in  [23]  and  yields  the  condition  for  the  global  variable  to 
be  ^eq  =  (1  -  [sinh(/^/2)]"4)i/8  this  case  the  critical  value 
of  the  coupling  constant  is  Kc  =  21n(l  +  a/2)  =  1.7627.  In 
Fig.  2,  we  show  the  phase  transition  in  the  latter  condition  and 
we  find  that,  as  expected,  the  numerical  evaluation  of  ^^q  (K)  is 
very  close  to  the  theoretical  prediction  of  Onsager  [3],  thereby 
confirming  that  the  DMM  is  equivalent  to  the  Ising  model  in 
the  limiting  case  go  ^  0. 

This  equivalence  between  the  DMM  and  the  Ising  model  is 
merely  formal,  because  the  DMM  does  not  have  a  Hamiltonian 
origin  and  does  not  require  the  action  of  a  thermal  bath  at 
temperature  T  to  work  as  does  the  Boltzmann  picture.  This 
explains  why  the  equivalence  with  the  Ising  model  requires 
that  ^0  vanish,  so  as  to  freeze  the  dynamics  of  the  single  units, 
in  the  absence  of  cooperation. 

When  we  release  the  condition  go  0,  the  equivalence  of 
the  DMM  with  the  Ising  model  is  lost.  In  Fig.  3,  we  see  that, 
if  the  condition  of  ^  1  is  abolished,  the  phase  transition 
emerges  at  values  of  K  lower  than  Kc  of  the  theoretical 
prediction.  Simultaneously,  further  increase  of  go  (go  ~  0.40) 
leads  to  a  new  regime,  in  which  every  unit  is  surrounded  by 
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FIG.  2.  The  phase  diagram  for  global  variable  ^eq-  Thin  solid  line 
and  dashed  line  are  the  theoretical  predictions  for  the  fully  connected 
and  two-dimensional  regular  network,  respectively.  In  both  cases 
L  =  oo  and  the  latter  case  is  the  Onsager  theoretical  prediction  [3] 
for  2D  regular  lattice.  The  thick  solid  line  corresponds  to  the  global 
states  observed  for  two-dimensional  regular  lattice  (L  =  100  x  100 
nodes)  and     =  0.01.  Periodic  boundary  conditions  were  applied. 

nearest  neighbors  in  the  opposite  state,  yielding  an  update  of 
its  state  at  every  time  step  and  generating  the  condition  in 
which  the  order  parameter  ^(t)  is  exactly  zero  at  all  times.  It 
is  worth  pointing  out  that  when  a  network  characterized  by 
a  large  transition  rate  g  (transition  rate  ^  ~  1)  is  studied,  it 
shows  sensitivity  to  the  initial  random  configuration  of  the 
lattice.  We  observed  that  in  a  limited  number  of  cases  the 
order  parameter  evolves  to  a  global  majority  state,  in  which 
all  the  nodes  are  in  the  same  state  rather  than  in  condition 
^(t)  =  ^eq  =  0.  Therefore  one  may  consider  a  bifurcation  in 
the  phase  space  of  the  model  parameters  in  this  regime. 

III.  ORDER  PARAMETER  REVERSAL  TIMES 

Let  us  now  study  the  temporal  complexity  of  the  order 
parameter  ^{t).  As  pointed  out  in  Sec.  I,  to  prove  temporal 
complexity  of  ^(t)  we  have  to  observe  significant  events 
and  prove  that  they  are  crucial.  At  criticality  the  signal  is 
not  dichotomous  and,  as  we  shall  see  hereby,  there  are  good 
reasons  why  it  must  depart  from  the  dichotomous  condition. 


0.50 
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FIG.  3.  The  two-dimensional  phase  diagram  of  global  variable 
^eq  evaluated  for  a  range  of  model  parameters,  and  ^,  on  a  two- 
dimensional  regular  lattice  of  size  L  =  100  x  100  nodes. 
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FIG.  4.  Survival  probability  function  ^(r)  for  the  order  parame- 
ter f  (t)  evaluated  on  a  two-dimensional  lattice  of  size  L  =  50  x  50 
for  go  =  0.01  and  increasing  values  of  coupling  constant  K.  The 
straight  line  corresponds  to  the  slope  of  —0.50,  namely  to  /x  =  1.50, 
since  ^(r) 


FIG.  5.  The  survival  probability  function  ^(t)  of  the  global  order 
parameter  ^(t)  is  compared  with  the  transitions  between  two  states 
for  a  single  unit  Si.  Simulations  were  performed  on  a  lattice  of 
size  L  =  50  X  50  for  =  0-01  and  increasing  values  of  coupling 
constant  K. 


We  make  the  conjecture  that  the  crossing  of  the  origin,  namely 
the  times  at  which  ^(t)  changes  sign,  are  the  significant  events 
to  observe.  As  illustrated  on  Fig.  1,  we  interpret  the  time 
interval  r  between  two  consecutive  crossings  as  the  time 
duration  of  a  given  decision,  even  if  this  decision  may  rest 
on  a  slight  and  fluctuating  majority. 

We  evaluate  the  distribution  density  of  decision-time  dura- 
tions r ,  V^(r),  and  the  corresponding  survival  probability  ^(r), 
where  ^(t)  =  JrV^(r).  Although  emerging  from  a  simple 
regular  lattice,  that  is,  one  with  no  structural  complexity, 
the  survival  probability  presented  in  Fig.  4  shows  a  scale-free 
property  that  extends  over  more  than  four  decades  in  time 
for  K  ^  Kc.  A  further  increase  of  K  does  not  affect  the 
power-law  region  and  has  the  effect  of  producing  a  more  and 
more  extended  exponential  shoulder.  The  exponential  shoulder 
is  expected  to  become  predominant  fovK  oo. 

As  mentioned  earlier,  a  single  unit  in  isolation  fluctuates 
between  two  states  with  the  transition  rate  ^  =  ^o-  The 
corresponding  survival  probability  function  is  an  exponential 
function  ^(r)  =  exp(—goT).  At  the  same  time,  as  illustrated 
on  Fig.  5,  a  coupled  unit  tends  to  update  its  state  with  a 
transition  rate  smaller  than  ^o-  This  is  a  property  of  criticality 
that  is  lost  completely  for  very  large  values  of  the  control 
parameter  K.  Although  it  is  a  computational  challenge  to 
explore  the  dynamics  corresponding  to  extremely  large  values 
of  K,  our  numerical  results  suggest  that  when  K  is  very  close  to 
the  critical  value,  and  a  decision  is  reached  by  a  slight  majority, 
the  single  units  have  a  dynamics  almost  indistinguishable 
from  the  Poisson  dynamic  that  they  would  have  in  isolation. 
As  we  increase  the  control  parameter  K  and  the  majority 
becomes  larger  the  single  units  keep  their  Poisson  dynamics 
with  a  smaller  rate.  Although  it  is  not  possible  because  of 
computer- time  limitations  to  study  the  network's  dynamics 
for  values  of  K  much  larger  than  K  =  1.80,  it  is  plausible  to 
make  the  conjecture  that  the  single  units  maintain  their  Poisson 
dynamics  and  that  these  dynamics  become  closer  and  closer  to 
that  of  the  global  variable  that  will  lose  its  power-law  structure 
and  will  become  predominantly  exponential.  In  other  words, 
unanimous  and  permanent  consensus  must  be  perceived  as  the 


manifestation  of  a  limiting  condition  of  Poisson  but  infinitely 
slow  dynamics,  where  the  behavior  of  the  single  units  becomes 
identical  to  that  of  the  global  signal. 

To  make  this  argument  more  compelling  let  us  discuss  the 
properties  of  the  observed  exponential  shoulder.  Since  one 
observes  the  network  in  an  organized  phase  (K  >  Kc)  it  is 
not  surprising  that  one  perceives  the  exponential  signature  of 
an  equilibrium  regime.  Therefore  we  assume  that  under  those 
conditions  the  global  variable  ^{t)  follows  the  dynamics  of  a 
particle  diffusing  in  a  double  potential  well.  In  the  equilibrium 
condition  one  would  expect  the  survival  time  of  staying  in 
one  well  to  be  ^(r)  =  exp(— (2r),  where  a  =  A  exp(— 2/Z)). 
The  parameter  Q  denotes  the  height  of  the  potential  barrier 
separating  the  wells,  and  D  stands  for  the  diffusion  coefficient. 
Following  [19]  we  expect  the  barrier  to  be  a  function  of 
the  coupling  constant  K  and  the  diffusion  to  depend  on  the 
number  of  nodes  L.  By  fitting  the  shoulder,  once  for  a  case 
where  L  is  kept  constant  and  K  varies,  and  secondly  for  an 
opposite  condition,  we  assess  the  above  hypothesis.  The  fitting 
procedure  revealed  that  the  barrier  height  2  is  a  linear  function 
ofK  {Q  ~  K)  and  that  the  diffusion  coefficient  D  is  inversely 
proportional  to  the  system  size  L  {D  ~  1/L).  For  L  ^  oo  the 
transition  rate  becomes  infinitely  small  and  the  exponential 
shoulder  becomes  predominant  and  virtually  coincident  with 
the  Poisson  dynamics  of  the  single  units. 

While  in  the  organized  phase  the  effect  of  increasing  L 
is  that  of  making  the  process  exponential,  although  with  a 
virtually  infinite  transition  time.  At  criticality  the  increase  of 
L  has  a  dramatically  different  effect.  The  authors  of  [19], 
based  on  this  DMM  in  the  all-to-all  condition,  found  that  at 
criticality  an  inverse-power-law  behavior  emerges,  which  is 
confined,  however,  to  a  time  region  with  the  same  size  as  the 
Poisson  time  Tp  ~  l/^o-  However,  this  is  not  the  case  when 
the  units  are  the  nodes  of  a  regular  two-dimensional  lattice, 
interacting  only  with  the  four  nearest-neighbor  nodes.  We 
have  evaluated  the  time  size  of  the  inverse-power-law  region 
appearing  before  the  exponential  shoulder  and  we  found  that 
its  size  tends  to  increase  with  increasing  L  as  ~  L~^-^^. 
Unfortunately,  this  observation  is  limited  to  values  of  L 
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smaller  than  L  =  10000,  since  going  beyond  would  exceed 
the  limits  of  our  computer  facilities.  However,  on  the  basis  of 
this  observation  we  reach  the  conclusion  that  for  L  ^  oo  the 
size  of  the  temporal  complexity  must  become  infinitely  large. 
Thus,  criticality  is  a  kind  of  complex  singularity  embedded  in  a 
Poisson  sea. 

These  intuitive  arguments  also  explain  why  the  signal  ^{t) 
at  criticality  must  depart  from  the  dichotomous  condition. 
This  is  a  consequence  of  the  fact  that  the  single  units  are 
not  rigidly  bound  to  follow  the  opinion  of  the  majority. 
There  is,  consequently,  a  subtle  connection  between  criticality, 
free  will,  and  correlation  between  different  units.  Although 
the  single  units  may  change  opinion,  they  cannot  do  that  in  a 
way  totally  uncorrelated  from  the  behavior  of  the  other  units, 
insofar  as  this  would  be  incompatible  with  the  emergence  of  a 
majority,  as  slight  as  it  might  be. 

IV.  TESTING  RENEWAL  PROPERTIES 

As  pointed  out  earlier,  the  signal  ^{t)  at  criticality  is  not 
dichotomous.  However,  replacing  it  with  a  dichotomous  signal 
corresponding  to  1  (yes)  when  ^{t)  >  0  and  to  —1  (no)  when 
^(t)  <  0,  thereby  turning  it  into  an  ideal  dichotomous  signal, 
is  the  most  convenient  way  to  reveal  the  emergence  of  crucial 
events  at  criticality.  The  theory  [24]  (see  also  the  earlier 
work  [25])  allows  us  to  evaluate  the  nonstationary  correlation 
function  ^(t,t^)  in  the  specific  case  when  the  renewal  process 
rests  on  the  waiting  time  distribution  density  \l/(t)  assumed  to 
be  an  ideal  inverse  power  law.  In  general,  regardless  of  whether 
this  ideal  condition  of  infinitely  extended  inverse  power  law  is 
realized  or  not,  it  is  possible  to  establish  whether  the  process 
is  renewal  by  noticing  that 


(10) 


where  x  =  t  —  t'  and  ta  =  t\  As  explained  [26],  the  function 
^^(TJa)  can  be  evaluated  by  using  the  time  series  {ti},  where  ti 
are  the  times  at  which  the  fluctuation  ^(t)  crosses  the  origin.  We 
use  a  mobile  window  of  length  ta  to  evaluate  the  waiting  time 
distribution  \l/(T,ta)  and  the  corresponding  survival  probability 
^f(T,ta),  by  locating  the  origin  of  the  window  on  the  time 
of  occurrence  of  an  event  and  measuring  the  time  distance 
between  the  end  of  the  window  and  the  time  of  occurrence  of 
the  first  event  after  the  window  end.  To  establish  whether 
the  process  is  renewal  or  not  we  shuffle  the  sequence  of 
reversal  times,  thereby  generating  a  new  time  series  {t-}. 
We  use  the  same  procedure  based  on  a  mobile  window  to 
determine  ^(^a,^,r),  which  corresponds  to  the  nonstationary 
correlation  function  in  the  renewal  case.  If  the  two  survival 
probabilities  coincide,  we  conclude  that  the  process  is  renewal. 
If  the  renewal  process  is  exponential,  the  correlation  function 
is  stationary,  and  there  is  no  aging. 

In  the  ideal  case  of  an  infinitely  extended  inverse  power  law, 
with  power  index  /x  <  2,  as  discussed  in  earlier  work  [16,24], 
aging  is  perennial.  In  the  case  studied  in  this  paper,  we  have 
to  take  into  account  that  this  ideal  condition  would  be  realized 
by  assigning  an  infinitely  large  value  to  L,  with  the  ensuing 
consequence  that  temporal  complexity  would  become  virtually 
invisible,  due  to  the  joint  effect  of  an  extremely  slight  majority 
and  of  a  very  extended  sojourn  in  a  given  decision  state.  To 
make  temporal  complexity  visible,  we  need  a  compromise. 
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FIG.  6.  Testing  for  renewal  property.  Survival  probability  func- 
tion ^(r)  of  the  order  parameter  ^(t)  evaluated  on  lattice  of  size 
L  =  50  X  50  and  =  0.01,  K  =  1.10  was  aged  respectively  by 
time  ta  =  100,  ta  =  1000,  and  ta  =  10000.  Those  aged  survival 
probabilities  ^(ta,T)  are  compared  with  the  aged  renewal  prediction 

'^(ta,s,T). 

and  we  have  to  set  a  limit  on  the  time  extension  of  the 
inverse  power  law.  As  a  consequence,  we  obtain  the  very 
interesting  result  depicted  in  Fig.  6,  where  aging,  namely  the 
slowing  down  of  the  survival  probability,  becomes  very  large 
in  the  inverse-power-law  region,  with  the  survival  probability 
remaining  virtually  constant,  and  in  the  correspondence  of 
the  exponential  shoulder  it  is  virtually  suppressed,  in  accor- 
dance with  the  principle  that  Poisson  statistics  annihilates 
aging. 


V.  CONCLUSIONS 

This  paper  establishes  that  at  the  onset  of  phase  transition, 
in  addition  to  the  spatial  and  network  complexity  shown  by  a 
number  of  earlier  investigations  [4-7,9, 1 2, 14, 1 5] ,  we  also  have 
temporal  complexity.  This  significant  result  reveals  a  path  for 
the  transport  of  information  from  one  complex  network  to 
another.  As  pointed  out  in  Sec.  I,  there  is  a  general  agreement 
that  complex  systems  are  a  set  of  many  units  interacting  at 
the  onset  of  phase  transition.  The  present  analysis  proves  that 
these  complex  networks  are  characterized  also  by  temporal 
complexity,  and  consequently  a  perturbing  complex  network 
is  expected  to  exert  its  influence  on  another  complex  network 
via  the  recently  discovered  complexity  management  process 
[21,31].  Although  the  present  theoretical  predictions  are  based 
on  the  assumption  that  V^(r)  is  an  ideal  inverse  power  law  and 
the  real  complex  networks,  as  shown  herein,  reach  this  ideal 
condition  only  when  their  size  is  infinitely  large,  the  result 
in  Fig.  5  indicates  that  the  time  region  generating  ergodicity 
breakdown  may  become  so  extended  as  to  make  a  complex 
network  virtually  insensitive  to  stimuli  that  do  not  share  the 
same  extended  nonergodic  condition. 

In  the  last  few  years  the  attention  of  investigators  has 
been  moving  from  chaos  synchronization  of  two  nonlinear 
oscillators  [27]  to  the  synchronization  dynamics  of  many  units 
in  large-scale  networks  [28],  called  inner  synchronization  [29] 
for  convenience.  The  cooperation  induced  phase  transition 
discussed  herein  can  to  some  extent  be  thought  of  as  a  form  of 
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inner  synchronization.  As  pointed  out  [29]  the  challenge  that 
the  researchers  in  the  field  of  complexity  have  now  to  address 
is  the  phenomenon  of  outer  synchronization  [30]. 

In  this  light,  we  conclude  that  the  results  presented  here 
provide  a  way  of  addressing  outer  synchronization,  redirecting 
the  attention  of  researchers  from  the  details  of  the  topology 
of  the  complex  networks  to  the  analysis  of  the  temporal 
complexity  that  inner  synchronization  may  generate.  In  fact  the 


principle  of  complex  management  [21,31]  rests  on  temporal 
complexity,  regardless  of  the  way  it  is  generated. 
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Abstract — A  channel-coded  physical-layer  network  coding 
strategy  is  refined  for  practical  operation.  The  system  uses 
frequency-shift  keying  (FSK)  modulation  and  operates  nonco- 
herently,  providing  advantages  over  coherent  operation:  there 
are  no  requirements  for  perfect  power  control,  phase  synchro- 
nism, or  estimates  of  carrier-phase  offset.  In  contrast  with 
analog  network  coding,  which  relays  received  analog  signals  plus 
noise,  the  system  relays  digital  network  codewords,  obtained  by 
digital  demodulation  and  channel  decoding  at  the  relay.  The 
emphasis  of  this  paper  is  on  the  relay  receiver  formulation. 
Closed-form  expressions  are  derived  that  provide  bitwise  log- 
likelihood  ratios,  which  may  be  passed  through  a  standard 
error-correction  decoder.  The  role  of  fading-amplitude  estimates 
is  investigated,  and  an  effective  fading-amplitude  estimator  is 
developed.  Simulation  results  are  presented  for  a  Rayleigh  block- 
fading  channel,  and  the  influence  of  block  length  is  explored. 
An  example  realization  of  the  proposed  system  demonstrates  a 
32.4%  throughput  improvement  compared  to  a  similar  system 
that  performs  network  coding  at  the  link  layer.  By  properly 
selecting  the  rates  of  the  channel  codes,  this  benefit  may  be 
achieved  without  requiring  an  increase  in  transmit  power. 

Index  Terms — Network  coding,  two-way  relay  channel, 
frequency-shift  keying,  noncoherent  reception,  channel  estima- 
tion. 

I.  Introduction 

IN  the  two-way  relay  channel  (TWRC),  a  pair  of  source 
terminals  exchange  information  through  an  intermediate 
relay  without  a  direct  link  between  the  sources  [1].  The 
exchange  can  occur  in  two,  three,  or  four  orthogonal  time 
slots,  depending  on  how  the  information  is  encoded  [2].  With 
a  traditional  transmission  scheduling  scheme,  the  exchange 
requires  four  slots.  In  each  of  the  first  two  slots,  one  of  the 
terminals  transmits  a  packet  to  the  relay,  while  in  each  of 
the  last  two  slots,  the  relay  transmits  a  packet  to  each  of  the 
terminals.  By  using  network  coding  [3],  the  number  of  slots 
can  be  reduced.  With  link-layer  network  coding  (LNC),  the 
third  and  fourth  slots  are  combined  into  one  slot  by  having 
the  relay  add  (modulo-2)  the  packets  that  it  receives  from  the 
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Fig.  1.  (a)  Link-layer  network  coding,  and  (b)  Physical-layer  network  coding. 

two  terminals.  During  the  third  step,  the  relay  sends  the  sum 
of  the  two  packets,  and  each  terminal  is  able  to  recover  the 
information  from  the  other  terminal  by  subtracting  (or  adding, 
modulo-2)  its  own  packet  from  the  received  signal.  With 
physical-layer  network  coding  (PNC),  the  first  two  slots  are 
combined  by  having  the  two  terminals  transmit  their  packets  at 
the  same  time  [2] .  The  relay  receives  a  combination  of  both 
modulated  packets  during  the  first  slot,  which  it  broadcasts 
(after  appropriate  processing)  to  the  two  terminals  during  the 
second  slot.  PNC-based  strategies  capable  of  supporting  more 
than  just  two  source  terminals  over  the  TWRC  may  be  found 
in  [4]. 

The  transmission  schedules  for  LNC  and  PNC  are  illustrated 
in  Fig.  1.  The  source  terminals  A/i  and  A/2  transmit  messages 
ui  and  U2,  respectively,  where  each  message  is  a  packet 
containing  many  information  bits.  The  messages  are  (channel) 
encoded  and  modulated  by  the  function  r5'(-).  In  the  case  of 
LNC,  the  two  messages  are  sent  in  orthogonal  time  slots,  while 
in  the  case  of  PNC,  they  are  sent  to  the  relay  at  the  same  time 
over  a  multiple-access  channel  (MAC).  For  both  LNC  and 
PNC,  the  relay  broadcasts  the  encoded  and  modulated  signal 
ri^(u)  in  the  final  time  slot,  where  u  is  the  network  codeword 
and  Tr{-)  is  the  function  used  by  the  relay  to  encode  and 
modulate  the  network  codeword.  Using  the  received  version 
of  ri^(u)  and  knowledge  of  its  own  message,  each  terminal 
is  able  to  estimate  the  message  sent  by  the  other  terminal. 

There  are  several  options  for  implementing  PNC.  The  relay 
may  simply  amplify  and  forward  the  signal  received  from  the 
end  nodes,  without  performing  demodulation  and  decoding. 
This  PNC  scheme  is  referred  to  as  analog  network  coding 
(ANC)  in  [5]  and  PNC  over  an  infinite  field  (PNCI)  in  [6]. 
Another  option  is  for  the  relay  to  perform  demodulation  and 
decoding  in  an  effort  to  estimate  the  network  codeword,  which 
is  remodulated  and  broadcast  to  the  terminals.  This  scheme 
is  simply  called  PNC  in  [2]  and  PNC  over  a  finite  field 
(PNCF)  in  [6],  but  in  this  paper  we  refer  to  it  as  digital 
network  coding  (DNC)  to  distinguish  it  from  ANC.  Under 
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many  channel  conditions,  DNC  offers  enhanced  performance 
over  ANC.  This  is  because  the  decoding  operation  at  the  relay 
helps  DNC  to  remove  noise  from  the  MAC  phase,  while  the 
noise  is  amplified  by  the  relay  when  ANC  is  used.  However, 
ANC  avoids  the  computational  complexity  of  demodulation 
and  decoding  at  the  relay. 

Symbol  timing  is  a  critical  consideration  in  systems  em- 
ploying PNC.  Synchronization  of  the  clocks  and  packet  trans- 
missions at  the  two  source  nodes  can  be  achieved  by  network 
timing  updates.  These  updates  are  routine  in  networks  with 
scheduling  mechanisms,  such  as  cellular  networks.  When  the 
propagation  times  of  the  signals  from  the  sources  differ,  the 
symbols  arrive  at  the  relay  misaligned.  The  timing  offset  is 
r  =  A^/c,  where  c  is  the  speed  of  light,  and  A^^  is  the 
difference  in  link  distances  from  the  sources  to  the  relay.  For 
insignificant  delay,  we  need  r  <<  T5/2,  where  T5  denotes 
the  symbol  period.  This  constraint  limits  the  symbol  rate.  As 
an  example,  assume  A^^  =  300  meters.  Then,  T5  >>  2  /is  is 
required,  and  the  symbol  rate  is  limited  to  250  kilosymbols/s. 
An  alternative  is  to  delay  the  transmission  of  the  node  closer 
to  the  relay  by  r.  However,  this  requires  tracking  the  distances 
between  the  sources  and  the  relay. 

A  common  assumption  made  in  the  PNC  literature  is 
that  the  signals  are  coherently  demodulated  and  that  perfect 
channel-state  information  (CSI)  is  available  at  the  receivers. 
For  instance,  decode-and-forward  relaying  has  been  consid- 
ered for  binary  phase- shift  keying  [7]  and  minimum- shift 
keying  [8]  modulations,  but  in  both  cases  the  relay  must 
perform  coherent  reception.  An  amplify-and-forward  protocol 
is  considered  in  [9],  which  allows  the  decision  to  be  deferred 
by  the  relay  to  the  end-node,  though  detection  is  still  coherent. 
When  two  signals  arrive  concurrently  at  a  common  receiver, 
neither  coherent  detection  nor  the  cophasing  of  the  two 
signals  (so  that  they  arrive  with  a  constant  phase  offset)  is 
practical.  The  latter  would  require  preambles  that  detract  from 
the  overall  throughput,  stable  phases,  and  small  frequency 
mismatches.  To  solve  this  problem,  frequency- shift  keying 
(FSK)  was  proposed  for  DNC  systems  in  [10]  and  [11]. 
A  key  benefit  of  using  FSK  modulation  is  that  it  permits 
noncoherent  reception,  which  eliminates  the  need  for  phase 
synchronization.  An  alternative  to  noncoherent  FSK  is  to  use 
differential  modulation,  which  has  been  explored  in  [12]. 

In  PNC  systems,  it  is  desirable  to  protect  the  data  with 
a  channel  code.  The  combination  of  channel  coding  and 
physical-layer  network  coding  is  considered  in  [13]  In  [11], 
we  investigate  the  use  of  a  binary  turbo  code  in  a  noncoherent 
DNC  system.  When  using  a  binary  turbo  code  in  a  DNC 
system,  the  relay  demodulator  must  be  able  to  produce  bitwise 
log-likelihood  ratios  (LLRs)  that  are  introduced  to  the  input 
of  the  channel  decoder. 

Channel  estimation  is  an  important  issue,  especially  when 
a  channel  code  is  used.  A  training-based  channel  estimation 
scheme  for  PNC  at  the  relay  assuming  amplify-and-forward 
operation  is  considered  in  [14].  The  relay  estimates  channel 
parameters  from  training  symbols  and  adapts  its  broadcast 
power  in  order  to  maximize  the  signal-to-noise  ratio  at  the  end 
nodes.  Estimation  of  both  channel  gains  in  the  two-way  relay 
channel  at  the  end  nodes,  rather  than  the  relay,  is  considered 
in  [15].  Novel  channel  estimators  are  presented  which  provide 


better  performance  than  common  techniques  such  as  least- 
square  and  linear-minimum-mean- squared  error  estimation.  In 
[16],  we  propose  a  blind  channel  estimator  for  the  relay  of  the 
noncoherent  DNC  system. 

In  this  paper,  we  investigate  receiver-design  issues  related 
to  the  use  of  noncoherent  FSK  in  DNC  systems.  While 
noncoherent  FSK  has  been  previously  proposed  for  DNC 
sytems  in  [10],  we  make  the  following  specific  contributions: 

1)  We  provide  closed-form  expressions  for  the  relay  re- 
ceiver decision  rule  with  different  types  of  CSI.  This 
is  in  contrast  with  [10],  which  resorted  to  numerical 
methods  to  solve  the  decision  rule  (see  the  comment 
below  equation  (8)  in  [10]). 

2)  We  consider  the  use  of  a  turbo  code  for  additional 
data  protection.  This  requires  that  the  relay  receiver  be 
formulated  so  that  it  produces  bitwise  LLRs,  which  may 
be  passed  through  a  standard  turbo  decoder. 

3)  We  provide  results  for  Rayleigh  block-fading  channels. 
The  results  in  [10]  were  only  for  a  phase-fading  channel. 

4)  We  propose  a  channel  estimator  that  is  capable  of 
determining  the  fading  amplitudes  of  the  channels  from 
the  two  terminals  to  the  relay.  The  estimator  does  not 
require  pilot  symbols. 

The  remainder  of  this  paper  is  organized  as  follows. 
Section  II  presents  the  system  model  used  throughout  the 
paper.  Section  III  derives  the  relay  receiver,  while  Section 
IV  discusses  channel-estimation  issues.  Section  V  provides 
simulation  results,  and  Section  VI  concludes  the  paper. 

II.  System  Model 

The  discrete-time  system  model  shown  in  Fig.  2  gives 
an  overview  of  the  processing  at  all  three  nodes.  Terminal 
Mi^i  G  {1,2},  generates  a  length-i^T  information  sequence, 
=  ^i,^]-  The  two  terminals  channel-encode 

and  modulate  their  information  sequences  using  the  function 
r5'(-),  which  is  common  to  both  nodes.  A  rate-ri  turbo  code 
is  used,  and  the  resulting  length  Ls  =  K/ri  turbo  codeword 
generated  by  Mi  is  denoted  by  =  [hi^i^  ...hi^Ls]  (i^^t  shown 
in  the  diagram).  The  signal  transmitted  by  node  J\fi  during 
signaling  interval  kTg  <  t  <  (/c  +  l)Ts  is 


COS 


2^  (  /c.  +  ^  )     -  kTs) 


(1) 


where  8i  is  the  transmit  energy,  /c-  is  the  carrier  frequency 
of  node  J\fi  (in  practice,  the  carrier  frequencies  of  the  two 
nodes  are  not  necessarily  the  same),  and  Tg  is  the  symbol  pe- 
riod. Note  that  (1)  is  continuous-phase  frequency- shift  keying 
(CPFSK)  with  a  unity  modulation  index,  which  is  orthogonal 
under  noncoherent  demodulation  and  has  a  continuous  phase 
transition  from  one  symbol  to  the  next  [17].  The  orthogonally- 
modulated  signal  Si  (t)  may  be  represented  in  discrete  time  by 
the  2  X  Ls  matrix      =  [x^^i,  ...,x^^Lg]  with  k^^  column 


IT 


if  bi^k  =  0 
if  bih  =  1. 


(2) 


For  the  DNC  system,  the  signals  are  transmitted  simulta- 
neously by  the  two  source  nodes  over  a  MAC  channel.  The 
relay  receives  the  noisy  electromagnetic  sum  of  interfered  and 
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Fig.  2.    Discrete-time  system  model. 

faded  signals,  Y,  and  applies  the  demodulation  and  channel- 
decoding  function  r^^(-).  The  demodulation  operation  yields 
a  soft  estimate  of  the  network-and-channel-coded  message 
b  =  bi  0  b2  (not  shown),  while  the  channel-decoding  op- 
eration yields  a  hard-decision  on  the  network-coded  message 
u  =  ui  0  U2.  With  the  LNC  system,  the  two  sources  transmit 
during  orthogonal  time  slots.  The  received  versions  of  Xi  and 
X2  are  demodulated  independently  to  provide  soft  estimates 
of  bi  and  b2.  These  soft  estimates  are  combined  and  turbo 
decoded  to  yield  a  hard  estimate  of  u.  The  key  distinction 
between  DNC  and  LNC  is  that  with  the  DNC  system,  the 
estimate  of  b  is  obtained  directly  from  Y,  while  with  LNC  it 
is  found  by  independently  demodulating  the  two  source  signals 
and  then  combining  them. 

During  the  broadcast  phase,  the  relay  encodes  and  mod- 
ulates u  using  the  function  Tr{-),  which  may  be  different 
than  the  function  Ts{-)  used  by  the  sources.  The  channel 
code  applied  by  the  relay  is  a  rate-r2  turbo  code,  yielding 
a  length  Lr  =  K/r2  turbo  codeword.  The  code  rates  ri 
and  r2  used  by  the  sources  and  relays,  respectively,  do  not 
need  to  be  the  same.  In  the  simulation  results,  we  contemplate 
using  a  stronger  code  for  the  MAC  phase  than  the  broadcast 
phase,  i.e.  ri  <  r2-  The  relay  broadcasts  its  encoded  and 
modulated  signal,  which  may  be  represented  in  discrete-time 
by  the  2  X  matrix  X.  The  signal  traverses  two  independent 
fading  channels,  and  the  end  nodes  receive  independently 
faded  versions  of  X:  Zi  at  A/i  and  Z2  at  A/2.  The  end 
nodes  demodulate  and  decode  their  received  signals  using 
the  function  r^^(-),  and  form  estimates  of  u.  Let  u  denote 
the  estimate  at  A/i  and  u  denote  the  estimate  at  A/2.  Next, 
estimates  of  the  transmitted  information  messages  are  formed, 
U2  =  u  0  ui  at  A/i  and  ui  =  u  0  U2  at  A/2.  Since  the 
links  in  the  broadcast  phase  are  conventional  point-to-point 
links,  specific  details  of  the  receiver  formulation  will  not  be 
presented  here.  A  detailed  exposition  of  receiver  design  for 
turbo-coded  CPFSK  systems  in  block  fading  channels  can  be 
found  in  [18]. 

All  of  the  channels  in  the  system  are  modeled  as  block- 
fading  channels.  A  block  is  defined  as  a  set  of  N  symbols 
that  all  experience  the  same  fading  amplitude.  The  duration 
of  each  block  corresponds  roughly  to  the  channel  coherence 
time.  Ideally  both  sources  transmit  with  the  same  carrier 
frequency  fc^  —  fc^-  However,  due  to  instabilities  in  each 
source  node's  oscillator  and  different  Doppler  shifts  due  to 
independent  motion,  it  is  not  feasible  to  assume  that  these 
two  frequencies  are  the  same  at  the  relay  receiver.  At  best, 
the  relay  receiver  could  lock  onto  one  of  the  two  frequencies, 
in  which  case  the  received  phase  of  the  other  signal  would 


drift  from  one  symbol  to  the  next.  To  model  this  behavior, 
we  let  the  phase  shift  within  a  block  vary  independently  from 
symbol  to  symbol. 

The  signal  matrix  X^  transmitted  by  node  Mi  may  be 
partitioned  into  Njj  =  Ls/N  blocks  according  to 


X, 


X 


(1) 


X 


(3) 


where  each  block  X-^\  l<^<A/'5,  isa2xA^  matrix,  and 
Nij  is  assumed  to  be  an  integer.  The  channel  associated  with 
block  X  •  ^  is  represented  by  the  N  x  N  diagonal  matrix 


H 


a, 


X  diag(exp{ji9fi^}, 


where  af-'  is  a  real-valued  fading  amplitude  and  6>^^*^  is  the 

phase  shift  of  the  k^^  symbol.  The  {O^^l}  are  independent  and 

identically  distributed  over  the  interval  [0,  27r).  The  {o^-^^}  are 
normalized  so  that  Si  represents  the  average  energy  of  terminal 
A/i  received  by  the  relay.  The  block  at  the  sampled  output 
of  the  relay  receiver's  matched-filters  is  then 


(4) 


-XfHf 


(5) 


where  N*^^^  is  a  2  x  A/"  noise  matrix  whose  elements  are  i.i.d. 
circularly- symmetric  complex  Gaussian  random  variables  with 
zero  mean  and  variance  A'o- 

III.  Relay  Receiver 

At  the  relay,  each  block  Y*^^^  of  the  channel  observation 
matrix  Y  is  passed  to  a  channel  estimator,  which  computes 
estimates  of  the  ^  and  •  A  full  description  of  the  esti- 
mator is  given  in  Section  IV.  The  fading-amplitude  estimates 
and  channel  observations  are  used  to  obtain  soft  estimates  of 
the  network-and-channel-coded  sequence  b.  The  demodulator 
operates  on  a  symbol-by- symbol  basis,  and  therefore  we 
may  focus  on  a  single  signaling  interval  by  dropping  the 
dependence  on  the  symbol  interval  k  and  the  block  index  i. 
Let  bi  and  62  be  the  turbo-coded  bits  transmitted  by  terminals 
A/i  and  A/2,  and  let  6  =  61 062  be  the  corresponding  network- 
coded  bit.  The  relay  demodulator  computes  the  LLR 


A{b)    =  log 


P{b 


P{b  =  0\y) 


i|y) 


P{bi 


o|y) 


(6) 


where  y  is  the  corresponding  column  of  Y.  The  event  {61  0 
62  =  1}  is  equivalent  to  the  union  of  the  events  {61  =  0,  62  = 
1}  and  {bi  =  1,  62  =  0}.  Similarly,  the  event  {bi  0  62  =  0} 
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is  equivalent  to  the  union  of  the  events  {61=0,62  =  0}  and 
{bi  =  1,  62  =  1}.  It  follows  that 


A{b)  =  log 
=  log 


P  {{bi  =  0,  62  =  1}  U  {hi  =  l,b2  =  0}|y) 
P  {{bi  =  0, 62  =  0}  U  {bi  =  1, 62  =  l}|y) 
P  {{bi  =  0, 62  =  l}|y)  +  P  ({61  =  1, 62  =  0}|y) 
P  {{bi  =  0, 62  =  0}|y)  +  P  ({61  =  1, 62  =  l}|y) 


(7) 


where  the  second  line  follows  from  the  first  because  the  events 
are  mutually  exclusive. 

A.  LNC  Receiver 

In  the  LNC  system,  the  LLR's  of  bi  and  62  are  first 
computed  independently  during  the  orthogonal  time  slots  and 
are  then  combined  according  to  the  rules  of  LLR  arithmetic. 
The  LLR  of  the  signal  sent  from  node  Mi  to  the  relay  is 


log 


P{h  =  l\y) 
P{h  =  0|y) 


(8) 


where  y  is  the  signal  received  during  the  time  slot  that  node 
A/i  transmits.  When  the  fading  amplitudes  ai^i  =  1,2,  are 
known,  but  the  phases  Oi^i  =  1,2,  are  not  known,  then  (8)  is 
found  using  [19] 


A(5,) 


log/o 


2ai|y2| 


log/o 


2ai|t/i| 
No 


(9) 


where  /o(-)  is  the  zeroth-order  Bessel  function  of  the  first 
kind  and  yi  and  1/2  are  the  components  of  y.  If  the  fading 
amplitudes  are  not  known,  but  have  Rayleigh  distributions, 
then  (8)  is  found  using  [19] 


1  +  £i/No 


{\y2?-\yi?]- 


(10) 


Once  the  individual  LLR's  from  each  end  node  are  found 
using  (9)  or  (10),  the  LLR  of  the  LNC  system's  network 
codeword  can  then  be  found  from  (7)  and  the  independence 
of  61  and  62  when  y  is  given: 


Kib)  =  lo, 


oA(62) 


I  ^  gA(6i)+A(62) 

=  max*  [A(6i),  A(62)]  -  max  *  [0,  A(6i)  +  A(62)]  (H) 
where  max*[x,  y]  =  log(e^  +  e^). 


B.  PNC  Receiver 

In  the  PNC  system,  it  is  not  sensible  to  compute  A{bi)  and 
A{b2)  separately.  Instead,  use  (7)  and  assume  that  the  four 
events  are  equally  likely  along  with  B ayes'  rule  to  obtain 

A(6)=log  [p  (y|{6i  =  0, 62  =  1})  +  p  (y|{6i  =  1, 62  =  0})] 

-  log  [p  (y|{6i  =  0,  62  =  0})  +  p  (y|{6i  =  1,  62  =  1})] . 

(12) 

The  computation  of  each  p(y|{^i,^2})  by  the  PNC  relay 
receiver  given  various  levels  of  channel  state  information  is 
the  subject  of  the  remainder  of  this  section. 


1)  Coherent  PNC  Receiver:  When  the  fading  amplitudes 
and  phases  are  known,  p(y|{^i,^2})  is  conditionally  Gaus- 
sian. The  mean  is  a  two-dimensional  complex  vector  whose 
value  depends  on  the  values  of  {61,  62}  and  the  complex  fading 
coefficients  {/ii,/i2},  which  are  the  corresponding  entries  of 
the  H  matrix.  Let  m[6i,  62]  be  the  mean  of  y  for  the  given 
values  of  bi  and  62-  When  bi  7^  62,  the  two  terminals  transmit 
different  frequencies  and 


m[0,l]  =  [hi  h2  ]^ 
m[l,0]    =    [h2  hiY. 


(13) 


When  bi  =  62,  the  two  terminals  transmit  the  same  frequency 
and 


m[0,0]  =  [  (/ii  +  /12)  0  ]^ 
m[l,l]    =     [  0    (/11  +  /12)  ]^ 


(14) 


Since  there  is  a  one-to-one  correspondence  between  the  event 
{61,  62}  and  the  mean  vector  m[6i,  62],  it  is  equivalent  to  write 

P  (y|{^i,  ^2})  as  p  (y|m[6i,  62]),  where 

p{y\m[bM)=  (^)  exp|-i-||y-m[6i,62]||'}. 

(15) 

The  coherent  receiver  computes  each  of  the  p(y|{^i,^2}) 
required  by  (12)  by  substituting  the  corresponding  m[6i,62] 
defined  by  (13)  and  (14)  into  (15). 

2)  Noncoherent  PNC  Receiver  with  CSI:  Suppose  that 
the  receiver  does  not  know  the  phases  of  the  elements  of 
the  complex- valued  m[6i,62]  vectors,  but  does  know  the 
magnitudes  of  the  elements.  The  knowledge  of  the  magnitudes 
constitutes  a  type  of  channel- state  information  (CSI).  Define 
62]  to  be  the  two-dimensional  real  vector  whose  elements 
are  the  magnitudes  of  the  elements  of  the  complex  vector 
m[6i,  62].  When  bi  7^  62,  both  frequencies  are  used,  and 

Ax[0,l]    =    [  \hi\    \h2\  [         a2  Y 

Ax[l,0]    =    [  \h2\    M  [  ^2    ai  ]^  (16) 

When  bi  =62,  only  one  frequency  is  used,  and 

Ax[0,0]    =    [1^1  +  ^2!    0       =  [  a    0  ]^ 
m[1,1]    =    [0    I/11  +  /12I       =  [  0    a]^  (17) 


where  a  =      +  /12I  =  2ol\ol2  cos(^2  —  ^1). 

The  pdf  of  y  conditioned  on  /L6[6i,62]  may  be  found  by 
marginalizing  over  the  unknown  phases 

/'27r  /'27r 

p(y|/Lfc[6i,62])  =  /      /  p((/)i,(/)2)p(y|m[6i,62])(i(/)i(i(/)2. 

(18) 


0  ^0 


where  (\)\  and  02  are  the  phases  of  the  first  and  second 
elements  of  m[6i,62],  respectively. 

Assume  that  the  oti  are  Rayleigh  distributed  so  that  the 
hi  are  circularly- symmetric  zero-mean  complex  Gaussian*. 

*  The  receiver  derived  in  this  subsection  is  valid  even  for  non-Rayleigh  fad- 
ing, provided  that  the  received  phases  over  the  two  channels  are  independent 
and  uniform  over  (0,  27r). 
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When  bi  ^  62  each  element  of  m[6i,62]  is  a  circularly- 
symmetric  zero-mean  complex  Gaussian  and  therefore  has 
uniform  phase.  On  the  other  hand,  when  h\  =  62,  one  element 
is  hi-\-h2,  which  is  the  sum  of  two  circularly- symmetric  zero- 
mean  complex  Gaussians,  while  the  other  element  is  zero. 
Since  the  sum  of  two  circularly- symmetric  complex  Gaussians 
is  also  a  circularly- symmetric  complex  Gaussian,  it  follows 
that  hi  +  /i2  is  a  zero  mean  circularly- symmetric  complex 
Gaussian  and  therefore  its  phase  is  uniform.  Since  the  other 
element  is  zero,  its  phase  is  irrelevant  and  may  be  set  to  any 
arbitrary  distribution,  which  is  most  conveniently  chosen  to 
be  uniform.  Thus,  it  follows  that  0i  and  02  are  i.i.d.  uniform. 
Therefore,  the  pdf  conditioned  on  the  magnitudes  is 


My|M[&i,&2]^^y^  expj  1, 


-/ 

'No  Jo 


\y2-H2[bub2]e^'^^\'' 
No 


(19) 


where  Hk[bi,b2]  is  the  fc*''  element  of  /Lt[6i,62]  and 

^jo   No — 


exp 


{ 


No 


No 


(20) 


Substituting  (20)  into  (19), 


k=l  ^ 


No 


where 


(22) 


which  is  common  to  all  four  {61,  62}  and  will  therefore  cancel 
in  the  LLR  (12). 

For  each  event  {61,  62},  substitute  the  p  (y|/[x[6i,  62])  given 
in  (21)  with  the  ^^[61,62]  given  by  (16)  and  (17)  as  the 
corresponding  p  (y  I  {61,  62})  in  (12).  This  results  in 


K{h)  =  lo, 
-log 


;/iVoj^  ( 2^lHA  g-«?/iVoj^  ^2ai|y2 


No 
2a\y2 
No 


(23) 


As  discussed  in  Section  IV,  it  is  possible  to  accurately  esti- 
mate ai  and  0^2  in  the  considered  block  fading  environment, 
provided  the  blocks  are  sufficiently  long.  However,  it  is  not 
generally  feasible  to  precisely  estimate  a  because  the  phases 
Oi  and  62  are  varying  on  a  symbol-by- symbol  basis.  Since 
E[cos{62  —  6>i)]  =  0,  a  reasonable  approximation  when  an 
estimate  of  a  is  not  available  is  to  use 


(24) 


3)  Noncoherent  PNC  Receiver  without  CSI:  Suppose  that 
besides  not  knowing  the  phases  6>i ,  6^2 ,  the  relay  receiver  does 
not  know  the  magnitude  vector  62]-  Then,  the  relay  must 
operate  without  any  channel  state  information  except  for  the 
average  energies  81^62  and  the  noise  variance  A^o-  When  the 
magnitudes  62]  are  not  known,  then  the  conditional  pdf 
is  found  by  marginalizing  (21)  over  the  unknown  magnitudes 

P(y|{^l,^2})    =     /        /  p(/il,/i2)p(y|M[^l,^2])c^/ilC^/i2. 

Jo  Jo 

(25) 

where  /ii  and  /i2  are  the  magnitudes  of  the  first  and  second 
elements  of  ^^[61,62],  respectively. 

According  to  (16),  when  bi  ^  62,  one  of  the  /i^  =  ai  while 
the  other  /i/^  =  0^2.  Since  ai  and  a2  are  independent  and  each 
ai  is  Rayleigh  with  energy  Si,  it  follows  that  the  joint  pdf  of 
/ii  and  fi2  when  (61,  62)  =  (0, 1)  is 


P(M1,M2) 


2^1 
Si 


■  exp 


{-i})(^-{-f} 


(26) 


for  /ii,  /i2  >  0,  and  when  (61,  62)  =  (I,  0)  it  is 

f    /ii  1  W2/i2 


P(M1,M2) 


S2 


■  exp 


{-|})(lr-{-|} 


(27) 


for  fii,fi2  >  0.  Substituting  (26)  and  (21)  into  (25)  yields 


P(y|{&i  =  0,62  =  1}) 


■log 


\yi\ 


£i£2 


£1 
1 

si 


^No 
1 

iVo 


£2 
1 


-No 
1 

iVo 


1  -1 


(28) 


Similarly,  substituting  (27)  and  (21)  into  (25)  yields 


p(y|{6i  =  1,62  =  0}) 


1^1 1 


1^2  I 


log 


1 


Nl 
S2 

1 


^No 
1 

N~o 


Nl 
Si 


1  -1 


(29) 


•A^o 
1  1 

A£2j  \£i  '  No  J  ys^^m. 

As  indicated  by  (17),  when  hi  =  62,  one  of  the  jik  =  ol 
while  the  other  (ik  =  0.  As  discussed  below  (18),  in  a 
Rayleigh-fading  environment,  hi  and  /12  are  independent, 
complex-valued,  circularly- symmetric  Gaussian  variables,  and 
therefore  h  =  hi  -\-  h2  is  also  a  complex- valued,  circularly- 
symmetric  Gaussian  variable.  It  follows  that  a  =  \h\  is 
Rayleigh  with  energy  Si  +  ^2,  and  the  pdf  of  the  nonzero 
Mfc  is 


exp  ' 


/ife>0.  (30) 


f  1  +  f  2  I  ^1  +  f  2  . 
For  the  /ifc  =  0,  its  pdf  may  be  represented  by  an  impulse  at 
the  origin,  i.e.  p{jJ^k)  =  ^(/^/c)-  Substituting  these  pdfs  with 
the  appropriate  ^^[61,62]  into  (25)  yields 

r/  1  \  /  1      1  M-i 

P{y\{bi,b2})    =  log 


1 


Si +£2  J 

\yi\' 


1 

iVo 


A/-2 


A^n 


(31) 
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where  i  =  1  when  (61,  62)  =  (0,  0)  and  i  =  2  when  (61,  62)  = 
(1,1). 

Substituting  (28)  and  (29)  for  the  two  bi  ^  62  and  (31)  for 
the  two  bi  =  62  into  (12)  yields 


A{b)  =  \og 


+  log 
-  log 


exp 
exp 


66 

\yi\ 


I      ?        No  No        i  I 

16       6  J         16       6  J 


where     =  Si  ^  No,  6  =  ^2  +  A^o,  and  ^  =  ^1  +  ^2  +  A^o- 

IV.  Channel  Estimator 

The  goal  of  the  channel  estimator  is  to  estimate  the  values 
of  the  fading  amplitudes  ai  and  0^2  for  a  particular  fading 
block.  Let  the  fading  amplitudes  of  a  block  be  represented 
by  the  pair  {A,  B},  where  A  >  B.  Thus,  A  =  max{ai,  0^2} 
and  B  =  min{ai,  0^2}.  Note  that  in  (23),  exchanging  ai  and 
a2  does  not  change  the  final  expression.  Therefore  (23)  is 
commutative  in  ai  and  0^2,  and  may  be  written  as 


A(6) 


—  max* 


0^2  has  been  used  and 


where  the  approximation  a  ^ 
F{x)  =  log[/o(x)],  which  may  be  efficiently  and  accurately 
computed  through  the  following  piecewise  polynomial  fit: 

F(x)  =  log[/o(x)]^ 

0.22594x2  +  0.012495X  -  0.0011272  0  <  x  <  1 

0.12454x2  +  0.21758X  -  0.10782  1  <  x  <  2 

0.028787x2  +  0.63126X  -  0.56413  2  <  x  <  5 

0.003012x2  +  0.88523X  -  1.2115  5  <  x  <  15 

0.00053203x2  +  0.95304x  -  1.6829  15  <  x  <  30 

0.00013134x2  +  0.97674X  -  2.0388  30  <  x  <  60 

0.9943X  -  2.6446  60  <  x  <  120 

0.99722X  -  3.0039  120  <  x  <  500 

0.99916X  -  3.6114  x  >  500. 

(34) 


A.  Fading  Amplitude  Estimator 

To  estimate  A  and  B,  first  add  the  two  elements  of  each 
to  obtain 

U    =    Vi^i  +  ^i,2  =  hi^i  +  hi^2  +       +  ^i,2  (35) 

^  V  ' 

where  vi  is  circularly- symmetric  complex  Gaussian  noise  with 
variance  2A/'o,  and  hi^}^  is  the  channel  coefficient  between 
terminal  A//e,  /c  =  {1,  2},  and  the  relay  during  the  i^^  signaling 
interval.  The  signal  vi  is  the  noisy  sum  of  two  complex  fading 


coefficients,  and  therefore  the  fading-amplitude  estimation 
algorithm  proposed  by  Hamkins  in  [20]  may  be  used.  To 
determine  the  values  of  A  and  B,  a  system  of  two  equations 
with  two  unknowns  is  required.  The  first  equation,  found  by 
taking  the  expected  value  of  \ri^  under  the  assumption  that 
the  fading  amplitudes  are  fixed  for  the  block  in  question,  is 

E  \\Ti\^\    =    E[al^al^  2aia2  cos(l9i,2  -  6>i,i)] 

=    E  [al^  al]  =al^al=  A^  ^  5^  (36) 

The  second  equation  is  found  by  conditioning  on  the  event 
{|rp  >  which  is  equivalent  to  {cos(6>i,2-6>i,i)  >  0} 

and  has  expected  value  [20] 

4.AB 


E 


>  A^  ^B' 


B' 


(37) 


Solving  (36)  and  (37)  for  A  and  B  yields 
1 


B 


X 


■X)- 


X 


Y) 


X 


■X) 


X 


Y) 


(38) 


where  X  =  E  [|rp]  and  F  =  £■ 

Since  the  expected  values  required  for  (38)  are  not  tnown, 
they  may  be  estimated  by  using  the  corresponding  statistical 


averages, 


X 


Y 


1  ^ 

-Y 


N 
2 

TV 


i:|r,|2>X 


(39) 


where  N  is  the  size  of  the  fading  block  and  the  factor  2/N 


used  to  compute  Y  assumes  that 


>  X  for  approxi- 


mately N/2  symbols.  If  this  assumption  is  not  true,  then 
the  multiplication  by  2/N  can  be  replaced  with  a  division 
by  the  number  of  samples  that  satisfy  \ri\^  >  X.  As  an 
alternative  to  summing  over  the  Ir^p  >  X,  Hamkins  proposes 
summing  over  those  \ri\^  greater  than  the  median  value  of 
{|ri|2,...,|r^|2}  [20]. 

The  estimator  works  by  computing  estimates  X  and  Y  using 
(39)  and  the  {ri,...,rAr}  for  the  block.  These  estimates  are 
used  in  place  of  X  and  Y  in  (38),  which  yields  estimates  A 
and  B  of  A  and  B.  These  estimates  are  then  used  in  place  of 
A  and  B  in  (33). 

B.  Transmission-Case  Detection 

According  to  (35),  the  two  elements  of  are  always  added 
together.  When  6i  =  62,  only  one  tone  is  used,  and  the  noise 
can  be  reduced  if  the  receiver  processes  only  the  tone  used  and 
ignores  the  other  tone.  This  requires  that  the  receiver  be  able 
to  detect  whether  the  first  tone,  the  second  tone,  or  both  tones 
were  used,  which  may  be  implemented  using  a  variation  of  the 
"no-CSI"  receiver  described  in  subsection  III-B3.  In  [16],  we 
contemplate  an  estimator  that  uses  such  a  transmission-case 
detector.  However,  we  found  that  the  performances  with  and 
without  the  transmission-case  detector  were  virtually  identical 
and  do  not  consider  it  further  in  this  paper.  At  best,  proper  use 
of  the  transmission-case  detector  reduces  the  noise  variance 
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from  27Vo  to  Nq  during  the  symbol  intervals  that  both  nodes 
transmit  the  same  tone.  As  will  be  seen  in  the  numerical 
results,  the  estimator  is  resilient  enough  against  noise  that  this 
reduction  in  noise  variance  is  not  meaningful  and  does  not 
justify  the  additional  complexity. 

C.  Amplitude  Estimation  for  Single -Transmitter  Links 

During  the  broadcast  phase,  there  is  only  a  single  trans- 
mission, and  the  dual- amplitude  estimator  described  in  sub- 
section IV-A  is  not  necessary.  Similarly,  the  estimator  is  not 
needed  by  the  LNC  system  during  the  MAC  phase  since  the 
two  transmissions  are  over  orthogonal  channels.  To  estimate 
the  fading  amplitudes  for  the  links  involving  only  a  single 
transmitter  and  receiver,  the  simple  averaging  technique  given 
by  (29)  in  [21]  is  used,  which  is  described  as  follows.  Consider 
the  i^^  signaling  interval  during  the  i^^  fading  block.  Given 
transmission  of  tone  k,  in  the  absence  of  noise,  the  k^^ 
matched-filter  output  at  the  receiver  is  i/k^i  =  ae^^\  and  has 
magnitude  \yk,i\  =  All  other  matched-filter  outputs  in  the 
j^th  signaling  interval  are  0.  An  estimate  could  be  formed  by 
taking  the  maximum  \yk,i\  over  any  column  of  Y^.  In  the 
presence  of  noise,  an  estimate  of  a  can  be  formed  by  averaging 
across  all  columns  of  the  fading  block 

1  ^ 

i=l 

V.  Simulation  Study 

This  section  presents  simulated  performance  results  for  the 
relay  receiver  described  in  Section  III.  The  simulated  link 
model  is  as  described  in  Section  II,  with  specific  simulation 
parameters  given  in  the  following  subsections.  The  goal  of 
the  simulations  is  to  compare  the  performance  of  comparable 
DNC  and  LNC  systems  and  to  assess  the  robustness  of  the 
channel  estimator  proposed  in  IV.  Because  the  relay-broadcast 
phase  of  the  DNC  and  LNC  systems  operate  in  exactly  the 
same  manner  and  have  the  same  performance,  we  only  focus 
on  the  performance  of  the  MAC  phase. 

A.  Uncoded  Performance  with  Perfect  Channel  Estimates 

We  initially  consider  a  system  that  does  not  use  an  outer 
error-correcting  code,  and  thus  =  u^,  z  =  1,  2.  We  compare 
the  performance  of  the  LNC  and  DNC  systems.  With  the  LNC 
system,  the  two  nodes  transmit  their  messages  in  orthogonal 
time  slots  and  the  relay  receiver  first  generates  the  individual 
LLR's  during  each  time  slot  using  either  (9)  or  (10),  and 
then  the  two  LLR's  are  combined  using  (11).  When  there 
is  no  outer  error-correcting  code,  performance  using  (9)  is 
approximately  the  same  as  that  using  (10).  A  bit  error  is 
declared  at  the  relay  whenever  a  hard  decision  using  (11) 
results  in  an  erroneous  decision  on  the  corresponding  bit  of  the 
network  codeword  b.  Such  an  error  will  usually  occur  if  one 
of  the  two  bits  bi ,  62  is  received  incorrectly,  and  therefore  the 
error  rate  of  the  LNC  system  is  approximately  P5  ^  2p{l—p) 
where  p  is  the  bit  error  rate  of  noncoherent  binary  FSK 
modulation  [17]. 

With  the  DNC  system,  the  two  nodes  transmit  simultane- 
ously, and  the  relay  receiver  computes  the  LLR  using  (23) 


VDNC:  No  CSI 


E,/N^(dB) 


Fig.  3.  Bit  error  rate  at  the  relay  in  Rayleigh  fading  when  DNC  and  LNC  is 
used  and  S2  =  Si.  Depending  on  the  amount  of  channel  state  information  that 
is  available,  the  PNC  system  will  use  one  of  three  different  relay  receivers. 

when  the  magnitudes  iJ,[bi^b2]  are  known  or  (32)  when  they 
are  not.  A  hard  decision  is  made  on  the  LLR  and  a  bit 
error  is  declared  if  the  estimate  of  the  corresponding  network 
codeword  bit  b  is  incorrect.  We  assume  that  the  channel 
estimates  are  perfect,  and  since  there  is  no  error-correction 
coding,  the  size  of  the  fading  block  is  irrelevant  provided  that 
the  channel  coherence  time  is  not  exceeded. 

Initially,  we  set  the  average  received  energy  to  be  the  same 
over  both  channels,  i.e.  £2  =  £1  =  £s  =  £b-  Fig-  3  shows  the 
performance  of  the  LNC  and  DNC  systems  in  Rayleigh  fading 
with  equal  energy  signals.  As  anticipated,  the  LNC  system 
offers  the  best  performance,  which  is  approximately  3  dB 
worse  than  a  standard  binary  CPFSK  system  with  noncoherent 
detection  (the  loss  relative  to  conventional  CPFSK  is  due  to  the 
fact  that  both  bits  must  usually  be  received  correctly).  Three 
curves  for  the  DNC  system  are  shown  in  Fig.  3,  corresponding 
to  receivers  that  exploit  different  amounts  of  available  channel 
state  information.  The  best  performance  is  achieved  using  a 
receiver  implemented  with  (23),  which  requires  knowledge 
of  ai,a2,  and  a.  The  performance  of  the  DNC  system 
implemented  with  (23)  is  only  about  0.25  dB  worse  than  that 
of  the  LNC  system.  The  worst  performance  is  achieved  using 
a  receiver  implemented  using  (32),  which  does  not  require 
knowledge  of  the  fading  amplitudes.  The  loss  due  to  using 
(32)  instead  of  (23)  is  about  10  dB,  indicating  that  estimating 
the  fading  amplitudes  at  the  relay  is  necessary. 

While  it  may  be  feasible  to  estimate  ai  and  0^2,  estimating 
a  may  prove  to  be  more  difficult  because  it  will  depend  on 
not  only  the  individual  fading  amplitudes,  but  also  on  the 
phase  difference  between  the  two  channels.  Since  the  phase 
difference  might  change  more  quickly  than  the  individual 
amplitudes,  it  might  not  be  practical  to  estimate  a.  If  that 
is  the  case,  then  the  approximation  given  by  (24)  can  be  used 
in  place  of  the  actual  value  of  a.  The  performance  using  this 
technique  is  also  shown  in  Fig.  3  and  shows  a  loss  of  about  3 
dB  with  respect  to  the  known-/[x[6i,  62]  system,  which  requires 
knowledge  of  a. 

The  performance  of  DNC  is  sensitive  to  the  balance  of 
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V  DNC:  No  CSI 

*  DNC:  known  a  ,  a 


O  DNC:  known  a^,  a^,  a 


Fig.  4.  Bit  error  rate  at  the  relay  in  Rayleigh  fading  of  DNC  with  three 
different  receivers  and  either  S2  =  Si  (solid  line)  or  S2  =  (dashed 
line). 

power  received  over  the  two  channels.  Performance  is  best 
when  £1  =  £2-  In  order  to  evaluate  how  robust  the  DNC  relay 
receivers  are  to  an  imbalance  of  power,  the  simulations  were 
repeated  with  £2  =  4fi,  while  keeping  =  £s  =  (^i+^2)/2. 
These  results  are  shown  in  Fig.  4  for  the  three  receiver 
formulations  that  were  considered  in  the  previous  figure. 
When  the  power  is  imbalanced  in  this  way,  there  is  a  loss 
of  about  2  dB.  However,  the  loss  is  the  same  for  all  three 
receiver  implementations,  suggesting  that  they  are  robust  to 
an  imbalance  of  power. 


B.  Uncoded  Performance  with  Channel  Estimation 

We  now  consider  the  influence  of  channel  estimation,  but 
still  assume  that  the  system  does  not  use  error-correction 
coding.  In  the  simulations,  the  information  frames  generated 
at  the  end  nodes  contain  K  =  2048  bits  per  frame.  The  fading 
blocks  are  length  N  =  {8,  32, 128}  symbols  per  block.  The 
DNC  relay  implements  (32)  and  then  makes  a  hard  decision 
on  each  information  bit. 

The  bit  error-rate  performance  of  the  uncoded  system  is 
shown  in  Fig.  5.  The  performance  is  shown  with  the  estimator 
using  the  three  block  sizes  N  =  {8,  32, 128}  as  well  as  for 
the  case  of  perfect  estimates  of  ai  and  a2.  A  narrow  range 
of  error  rates  is  shown  to  better  highlight  the  differences  in 
performance.  In  general,  smaller  fading  blocks  lead  to  a  less 
accurate  estimation  of  the  fading  amplitudes,  as  the  number 
of  samples  available  for  estimation  decreases.  Moving  from 
block  size  N  =  128  to  32  worsens  performance  by  roughly 
0.25  dB,  and  from  TV  =  32  to  8  by  0.75  dB. 

C.  Performance  with  an  Outer  Turbo  Code 

Now  consider  a  system  that  uses  an  outer  turbo  code. 
The  terminals  each  encode  length  K  =  1229  information 
sequences  into  length  L  =  2048  codewords,  using  a  rate 
ri  ^  0.6  UMTS  turbo  code  [22].  The  relay  performs  turbo 
decoding  using  the  codeword  LLR's  computed  by  (32).  The 
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□  N=32 
*  N=8 

O  Perfect  Estimation 


Fig.  5.  Influence  of  fading-block  length  on  uncoded  DNC  error-rate 
performance  at  the  relay.  In  addition  to  curves  for  three  values  of  A^,  a  curve 
is  shown  indicating  the  performance  with  perfect  fading-amplitude  knowledge. 
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Fig.  6.     Influence  of  fading-block  length       on  turbo-coded  DNC  error- 
rate  performance  at  the  relay.  Two  curves  are  shown  for  each  value  of 
=  {8,  16,  32,  64,  128}.  Solid  curves  denote  perfect  fading-amplitude 
knowledge.  Dashed  curves  denote  estimated  fading  amplitudes. 

fading-block  lengths  simulated  are  N  =  {8,16,32,64,128} 
symbols  per  block. 

The  error  performance  of  the  coded  system  is  shown  in 
Fig.  6,  both  with  perfect  channel  estimates  and  with  estimated 
fading  amplitudes.  A  good  tradeoff  between  diversity  and 
estimation  accuracy  is  achieved  for  block  sizes  N  =  16 
and  N  =  32,  which  exhibit  the  best  performance  of  all 
systems  that  must  estimate  the  fading  amplitudes.  For  <  16 
performance  degrades  due  to  the  lack  of  enough  observations 
per  block  for  accurate  channel  estimates,  while  for  >  32 
performance  degrades  due  to  the  reduction  in  time  diversity. 

Fig.  7  shows  the  SNR  required  to  reach  an  error  rate  of 
10""^  at  the  relay  as  a  function  of  the  block  length  A^.  In  each 
case,  information  is  coded  with  the  same  (2048, 1229)  turbo 
code  used  for  Fig.  6.  Curves  for  three  systems  are  shown:  The 
noncoherent  receiver  with  known  {ai,Qf2},  the  noncoherent 
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Fig.  7.  Signal-to-noise  ratio  required  to  reach  a  bit  error  rate  of  10~^  at  the 
relay  as  a  function  of  fading-block  length.  The  performance  of  three  systems 
is  shown:  The  noncoherent  receiver  with  known  {0:1,0:2},  the  noncoherent 
receiver  with  estimated  {01,  02},  and  the  noncoherent  receiver  that  does  not 
use  CSI.  All  systems  use  a  Turbo  code  with  rate  1229/2048. 
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Fig.  8.  Comparison  of  error-rate  performance  between  the  turbo-coded  DNC 
and  LNC  systems  at  the  relay.  The  solid  lines  denote  DNC,  while  the  dashed 
lines  denote  LNC. 

receiver  with  estimated  {ai,  0^2},  and  the  noncoherent  receiver 
that  does  not  use  CSI.  When  {ai,a2}  are  not  estimated, 
performance  improves  with  decreasing  N  because  of  the 
increased  number  of  blocks  per  codeword,  which  increases 
the  time  diversity.  However,  when  {a  1,0^2}  are  estimated, 
the  performance  gets  worse  when  the  block  size  is  smaller 
than  N  =  16.  The  loss  of  time  diversity  as  the  block  size 
increases  is  a  common  problem  for  any  system  operating  over 
a  slow-fading  channel,  and  the  system  proposed  in  this  paper 
is  no  exception.  The  performance  gap  between  the  known- 
CSI  and  no-CSI  receiver  formulations  widens  with  increasing 
block  length. 

An  error-rate  performance  comparison  between  DNC  and 
LNC  is  shown  in  Fig.  8.  Both  systems  use  the  same 
(2048, 1229)  turbo  code.  The  LNC  system  outperforms  the 
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Fig.  9.  Comparison  of  the  performance  of  turbo-coded  DNC  and  LNC  at 
the  relay  with  block  size  =  32.  For  the  DNC  system,  two  code  rates  are 
shown,  with  the  lower  rate  code  offering  comparable  performance  to  the  LNC 
system. 

DNC  system  by  margins  ranging  between  4  and  6  dB. 

While  the  LNC  system  is  more  energy  efficient  than  the 
DNC  system  when  the  same-rate  turbo  code  is  used,  the 
throughput  of  the  LNC  system  is  worse  than  that  of  the  DNC 
system  because  the  two  terminals  must  transmit  in  orthogonal 
time  slots.  The  loss  in  energy  efficiency  from  using  DNC  ver- 
sus LNC  can  be  recovered  by  having  the  source  terminals  use 
a  lower-rate  turbo  code.  Consider  the  performance  comparison 
shown  in  Fig.  9  for  block  size  N  =  32.  At  Ej,/No  ^  24  dB, 
DNC  using  a  rate  ri  =  4500/6400  code  matches  the  error- 
rate  performance  of  LNC  using  a  rate  ri  =  4500/5056  code. 
Because  the  two  terminals  transmit  at  the  same  time,  the  end- 
to-end  throughput  of  DNC  is  higher  than  that  of  LNC,  even 
though  the  DNC  terminals  transmit  to  the  relay  with  a  lower- 
rate  channel  code. 

To  illustrate  the  throughput  improvement  of  DNC  over 
LNC,  consider  the  following  transmission  schedule  for  the 
two  systems.  Assume  the  source  terminals  use  rate  ri  = 
4500/6400  in  DNC,  and  n  =  4500/5056  in  LNC.  Assume 
operation  at  Ei^/Nq  =  24  dB,  yielding  approximately  equal 
relay  error-rate  performance.  Further,  assume  that  both  sys- 
tems use  code  rate  r2  =  4500/5056  for  relay  broadcast, 
yielding  approximately  equal  end-to-end  performance.  DNC 
requires  6400  channel  uses  for  transmission  to  the  relay  versus 
2  X  5056  =  10112  for  LNC.  Both  systems  require  5056 
channel  uses  for  relay  broadcast.  The  throughput  for  DNC 
is  thus  T^D^c)  ^  9000/(6400  +  5056)  =  9000/11,456  bits 
per  channel  use,  and  for  LNC  T^^^nc)  ^  9000/(3  x  5056)  = 
9000/15, 168  bits  per  channel  use.  The  percentage  throughput 
increase  of  DNC  over  LNC  is  thus  (T^^^c) ^j^iLNC) 
100  ^  32.4%. 

VL  Conclusion 

A  throughput-improving  technique  for  relaying  in  the  two- 
way  relay  network,  digital  network  coding,  is  refined  for  prac- 
tical operation.  The  system  operates  noncoherently,  providing 
advantages  over  coherent  operation:  there  are  no  requirements 
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for  perfect  power  control,  phase  synchronism,  or  estimates  of 
carrier-phase  offset. 

A  computationally  simple  technique  for  estimating  fading 
amplitudes  at  the  relay  is  implemented.  Error-rate  performance 
in  the  noncoherent  Rayleigh  block-fading  channel  at  several 
block  sizes  is  presented.  The  system  is  simulated  with  and 
without  an  outer  error-correcting  code.  The  coded  error-rate 
performance  of  the  system  using  estimation  differs  from  that 
with  ideal  estimates  by  margins  between  0.7  —  1.5  dB. 

When  the  same-rate  turbo  code  is  used,  digital  network 
coding  has  a  higher  throughput  but  lower  energy-efficiency 
than  link-layer  network  coding  .  The  energy  loss  of  DNC  can 
be  recovered  by  using  a  lower-rate  turbo  code  during  the  MAC 
phase.  Even  when  the  loss  of  spectral  efficiency  due  to  the 
lower-rate  turbo  code  is  taken  into  account,  the  DNC  system 
is  able  to  achieve  a  higher  throughput  than  LNC  at  the  same 
energy-efficiency.  In  the  particular  example  presented  in  this 
paper,  the  DNC  system  is  capable  of  achieving  throughputs 
that  are  32.4%  larger  than  that  of  the  equivalent  LNC  system, 
while  operating  at  the  same  energy  efficiency. 
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We  study  the  transport  of  information  between  two  complex  systems  with  similar  properties.  Both  systems 
generate  non-Poisson  renewal  fluctuations  with  a  power-law  spectrum  \/f^~^,  the  case  fi  =  2  corresponding  to 
ideal  l/f  noise.  We  denote  by  /xs  and  /xp  the  power-law  indexes  of  the  system  of  interest  S  and  the  perturbing 
system  P,  respectively.  By  adopting  a  generalized  fluctuation-dissipation  theorem  (FDT)  we  show  that  the  ideal 
condition  of  1//  noise  for  both  systems  corresponds  to  maximal  information  transport.  We  prove  that  to  make 
the  system  S  respond  when  /x^  <  2  we  have  to  set  the  condition  /xp  <  2.  In  the  latter  case,  if  /xp  <  /X5,  the 
system  S  inherits  the  relaxation  properties  of  the  perturbing  system.  In  the  case  where  /xp  >  2,  no  response 
and  no  information  transmission  occurs  in  the  long-time  limit.  We  consider  two  possible  generalizations  of 
the  fluctuation  dissipation  theorem  and  show  that  both  lead  to  maximal  information  transport  in  the  condition 
of  1  //  noise. 

DOI:  10.1 103/PhysRevE.83.051 130  PACS  number(s):  05.40.-a,  87.18.Tt,  87.19.1m,  89.70.Hj 


I.  INTRODUCTION 

Linear  response  theory  (LRT)  [1]  is  one  of  the  basic  ways 
of  obtaining  information  from  fluctuations  in  nonequilibrium 
statistical  physics  [2,3]  that  is  currently  adopted  to  address  new 
phenomena  such  as  glassy  systems  [2]  and  granular  matter  [3]. 
An  even  more  challenging  issue  is  the  application  of  LRT 
to  complex  processes  such  as  physiological  processes  and 
especially  the  understanding  of  brain  dynamics. 

It  is  becoming  more  widely  accepted  that  the  brain  operates 
at  criticality  [4,5]  and  that  the  critical  condition  of  a  phase 
transition  has  manifestations  that  extend  beyond  the  conven- 
tional condition  of  temperature  driven  systems  [6] .  Frantsuzov 
et  al  [7]  adopted  a  model  of  cooperatively  interacting  units  to 
propose  a  solution  to  the  long-standing  mystery  of  the  origin 
of  the  power-law  distribution  of  the  blinking  times  in  colloidal 
quantum  dot  fluorescence.  On  the  other  hand,  this  form  of 
intermittence  is  characterized  by  the  condition  of  renewal 
aging  [8]  and  by  the  consequent  ergodicity  breakdown  [9]  that 
makes  it  impossible  to  use  conventional  LRT.  We  refer  to  these 
systems  as  complex  systems.  It  is  now  very  well  understood 
[10]  that  the  breakdown  of  the  ergodic  condition  is  caused  by 
the  occurrence  of  crucial  events.  It  is  interesting  to  remark 
that  the  same  perspective  applies  to  complex  networks,  when 
the  cooperation-induced  phase  transition  produces  temporal 
complexity  and  crucial  events  as  illustrated  in  in  the  work  of 
Ref.  [11].  These  events  are  renewal,  namely,  the  time  interval 
between  two  consecutive  events  does  not  have  any  relation 
whatsoever  with  the  earlier  or  later  time  intervals  between 
two  consecutive  events.  Yet,  if  a  Gibbs  ensemble  is  suitably 
prepared  miiidiXXy,  namely,  in  all  the  systems  of  the  ensemble  an 
event  occurs  at  the  time  origin,  then  the  rate  of  event  production 
turns  out  to  be  time  dependent  rather  than  constant  as  in  the 
ordinary  Poisson  case.  The  authors  of  Refs.  [12,13]  applied 
this  theoretical  perspective  to  the  liquid  crystal  dynamics  and 
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experimentally  realized  a  true  cascade  of  renewal  events.  The 
time  interval  between  two  consecutive  crucial  events  is  given 
by  a  waiting  times  probability  density  function  (PDF)  i/^(r) 
with  the  following  asymptotic  form 

Vr(r)  =  (/X  -  1) — —  r,  (1) 

and  the  power-law  index  /x  fulfilling  the  inequality 

1  <  /X  <  3.  (2) 

It  is  important  to  explain  the  origin  of  the  special  form  of 
Eq.  (1).  First  of  all  we  want  to  stress  that  according  to  a  point 
of  view  in  the  field  of  complexity,  only  the  asymptotic  time 
behavior  matters,  namely,  V^(r)al/r^.  The  adoption  of  this 
widely  shared  point  of  view,  as  we  shall  see  hereby,  would 
prevent  us  from  establishing  a  correct  accordance  with  the 
experiments  on  the  response  of  complex  systems  to  external 
perturbations.  Thus,  the  choice  of  Eq.  (1)  is  dictated  by  the 
need  for  defining  a  border  between  the  asymptotic  time  regime 
(r  ^  T)  and  the  microscopic  time  regime  (r  ^  T).  Given  the 
neurophysiology  interest  of  this  paper  and  especially  the  focus 
on  brain  dynamics,  we  refer  the  interested  reader  to  the  work  of 
Ref.  [14],  where  the  waiting  times  PDF  of  Eq.  (1)  is  obtained  by 
means  of  a  Fechner  transformation  [14]  from  the  conventional 
Poisson  distribution. 

The  rate  of  cascade  of  renewal  events  tends  to  a  vanishing 
value  as  \/t^~^  when  fi  <  2  and  to  a  constant  value  as  l/t^~^ 
when  fi  >  2.  It  is  evident  that  in  both  cases  the  time  duration 
of  the  out-of-equilibrium  condition  is  infinite,  thereby  raising 
the  challenging  task  of  going  beyond  conventional  LRT  to 
describe  the  dynamics. 

Conventional  LRT  is  given  by  the  following  expression: 

cr(t)=m))=€  f  dsx(t,s)^p(s),  (3) 
Jo 

where  ^sO)  is  the  fluctuation  produced  by  the  system  of 
interest  S.  The  symbol  (^sO))  denotes  the  Gibbs  average 
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over  the  fluctuations.  In  the  absence  of  perturbation  this 
average  is  assumed  to  vanish.  The  variable  ^p(t)  denotes  the 
time-dependent  perturbation  and  €  its  intensity.  LRT  predicts 
the  response  of  S  on  the  basis  of  the  unperturbed  correlation 
function  of  §5.  In  fact,  the  function  xO^^)^  called  the  linear 
response  function,  is  related  to  the  correlation  function  of  the 
fluctuation  §5,  whose  quadratic  mean  value  is  assumed  for 
simplicity  to  be  normalized  to  unity 

^s(t,s)^(^sms(s)),  (4) 
by  the  following  expression 

d 

X(t,s)=  -—^s(t,s).  (5) 
ds 

Note  that  the  traditional  LRT  refers  to  the  stationary  case 

^s(t,s)  =  ^s(t-s),  (6) 
and  as  a  consequence 

^^lfs(t,s)  =  -^^s(t,s).  (7) 
ds  dt 

This  condition  is  not  fulfilled  by  complex  systems.  For  these 
latter  systems  the  choices  of  linear  response  functions 

X(t,s)  =  ^^s(t,s),  (8) 
ds 

and 

X(t,s)  =  -^^s(t,s),  (9) 
at 

are  not  equivalent. 

The  authors  of  Refs.  [15-18]  have  discussed  the  foundation 
of  both  choices  and  have  established  that  the  new  form  of  LRT 
is  determined  by  the  physical  way  through  which  perturbation 
determines  a  bias.  For  the  sake  of  simplicity  these  authors  have 
made  the  assumption  that  ^sO)  is  a  dichotomous  signal.  Using 
the  jargon  of  turbulence  theory  they  called  the  time  intervals 
between  two  consecutive  crucial  events  laminar  regions.  At  the 
moment  of  a  crucial  event  occurrence,  unperturbed  dynamics 
are  realized  by  the  random  selection  of  either  the  positive, 
§5  =  1,  or  the  negative  value,  §5  =  —  1.  In  other  words,  they 
assume  that  the  occurrence  of  a  crucial  event  generates  the 
tossing  of  a  fair  coin  which  determines  the  sign  of  the  next 
laminar  region.  Consequently,  the  external  perturbation  can 
generate  a  bias  in  two  different  ways.  The  first  way  rests  on 
affecting  the  fairness  of  the  coin  tossing  process.  If  ^p(t)  >  0 
(^p(t)  <  0)  the  choice  of  the  positive  (negative)  sign  is  more 
probable  than  the  choice  of  the  negative  (positive)  sign.  This 
prescription  leads  to  the  choice  of  Eq.  (8)  and  is  denoted  as 
phenomenological  LRT. 

The  experiments  done  by  the  authors  of  Refs.  [12,13]  show 
that  nature  prefers  Eq.  (9),  the  dynamic  LRT.  What  is  the 
theoretical  argument  in  favor  of  the  dynamical  theory?  To 
afford  a  convincing  answer  to  this  important  question,  let  us 
go  back  to  the  special  form  of  Eq.  (1).  We  note  that  we  do 
not  know  the  Hamiltonian  of  our  complex  system,  and  we 
do  not  even  know  if  a  satisfactory  discussion  of  the  complex 
dynamics  can  be  made  using  a  Hamiltonian  formalism.  Let 
us  assume  that  Eq.  (1)  is  a  reliable  representation  for  the 
distribution  length  of  the  laminar  region.  In  this  case,  a 


reasonable  conjecture  is  that  the  external  perturbation  affects 
either  /x  or  T,  or  both  parameters  defining  the  form  of  Eq.  (1). 
We  know  that  defines  the  system's  complexity  and  emerges 
from  the  cooperative  interaction  among  interacting  units.  A 
weak  external  perturbation  is  not  expected  to  change  the 
system's  complexity.  It  is  therefore  reasonable  to  assume  that 
the  external  perturbation  affects  T,  by  enlarging  (reducing) 
its  value  if  and  have  the  same  (opposite)  sign.  This 
assumption  leads  to  the  choice  of  Eq.  (9),  as  shown  earlier 
[15-18]. 

It  is  important  to  notice  that  the  response  of  a  complex 
system  of  the  same  nature  as  the  one  discussed  in  this  paper 
has  been  studied  by  many  authors  [19-28].  These  authors  did 
not  establish  a  connection  between  their  results  and  the  LRT 
of  Refs.  [15-18]  and  in  some  cases  they  made  the  misleading 
conjecture  that  their  results  establish  the  "death  of  LRT." 
Actually,  these  theoretical  treatments  are  asymptotic  in  time 
and  the  only  possible  connection  with  LRT  is  through  the 
adoption  of  the  phenomenological  theory  of  Eq.  (8),  as  the 
readers  can  establish  by  a  careful  reading  of  Ref.  [28]. 

We  are  now  in  a  position  to  define  the  main  purpose  of 
this  paper.  We  draw  the  attention  of  the  readers  to  the  recent 
results  of  the  authors  Ref.  [29].  This  paper  addresses  the 
important  issue  of  studying  the  response  of  a  complex  system 
to  a  complex  external  perturbation  with  the  surprising  result 
that  a  complex  system  does  not  respond  to  stimuli  that  are  not 
complex  (i.e.,  that  have  a  stationary  Fourier  spectrum).  It  is 
important  to  stress  that  the  authors  of  Ref.  [29]  focused  on 
the  correlation  between  ^s(t)  and  ^p{t)  in  the  long-time  limit. 
This  is  an  ideal  condition  that  has  the  effect  of  restraining  the 
definition  of  complexity  to  the  systems  with  /x  ^  2.  In  fact,  in 
the  long-time  limit  a  system  with  /x  >  2  reaches  the  normal 
condition  of  a  constant  rate  of  event  production,  thereby 
recovering  the  ordinary  Poisson  condition.  The  condition 
/X  =  2  is  of  fundamental  importance  for  brain  function.  In 
fact,  recent  work  [30,31]  established  that  the  brain  works 
with  /X  =  2,  which,  in  turn,  is  known  [32]  to  correspond  to 
making  the  brain  action  become  the  source  of  ideal  l/f  noise. 
The  results  of  the  authors  of  Ref.  [29]  may  therefore  have 
important  applications  to  design  the  most  convenient  stimuli 
to  drive  complex  networks,  and  especially  brain  dynamics,  via 
what  was  defined  as  "complexity  management"  [29] .  However, 
an  apparent  weakness  found  in  Ref.  [29]  is  that  these  results 
are  derived  from  the  adoption  of  the  phenomenological  LRT, 
thereby  raising  doubt  that  those  complex  systems,  which 
have  been  proven  to  obey  the  dynamical  LRT  [12,13],  may 
not  obey  the  principle  of  complexity  management  (CM) 
established  in  Ref.  [29].  Herein  we  prove  that  the  more 
realistic  dynamical  LRT  generates  CM.  In  addition  to  this 
main  purpose,  the  present  paper  affords  technical  details  on 
the  theory  developed  in  Ref.  [29]  that,  due  to  space  limitations, 
were  not  conveniently  illustrated. 

II.  FDT  FOR  THE  NONERGODIC  RENEWAL  REGIME: 
PHENOMENOLOGICAL  AND  DYNAMICAL 
APPROACH 

The  authors  of  Refs.  [15,16]  discovered  a  form  of  FDT  that 
applies  to  systems  or  networks  whose  dynamics  are  dominated 
by  non-Poisson  renewal  events.  In  the  stationary  case  this 
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FDT  becomes  indistinguishable  from  the  ordinary  theoretical 
prediction  [1]. 

Herein  we  investigate  the  consequences  of  the  adoption 
of  either  the  "phenomenological"  choice  Eq.  (8)  or  the 
"dynamical"  choice  (9),  in  the  special  case  where  both  ^p(t) 
and  ^sO)  are  event-dominated  processes  and  show  that  the 
transmission  of  information  from  P  to  5  is  determined  by  the 
dialogue  between  the  critical  events  of  ^sO)  and  the  critical 
events  of  ^p(t).  Specifically,  this  discussion  is  devoted  to 
studying  the  transport  of  information  from  P  to  S,  using 
both  forms  of  generalized  FDT  (gFDT).  Note  that  there  is 
no  limitation  on  the  form  of  provided  that  the  coupling  is 
weak  enough  as  to  be  compatible  with  the  emergence  of  the 
linear  response  form  of  Eq.  (3). 

To  discuss  the  problem  of  the  transmission  of  information 
from  P  to  5*,  it  is  convenient  to  imagine  the  ideal  case  of  a  Gibbs 
ensemble  of  composite  systems  S  -\-  P.  Thus,  in  this  paper  we 
imagine  that  P  generates  a  fluctuating  signal  ^p(t)  and  that 
for  any  signal  ^p(t)  there  exists  a  response  ^s(0-  For  each 
perturbing  signal  we  have  to  make  infinitely  many  experiments 
and  average  over  all  possible  responses.  For  simplicity  we  take 
both  signals  ^s(0  and  ^p(t)  to  be  dichotomous  and  fluctuating 
between  values  ±1.  It  is  important  to  remark  that  Eq.  (3),  for 
the  response  of  the  system  S  to  the  perturbation  P,  is  valid 
when  the  system  is  prepared  at  time  ^  =  0  and  placed  at  the 
beginning  of  a  laminar  phase,  and  the  interaction  with  the 
perturbation  P  is  turned  on  at  the  same  time. 

In  the  general  case  of  a  dicothomous  renewal  process  ^(t), 
generated  with  a  waiting-times  PDF  \l/(t),  the  probability 
density  that  fixed  a  time  t\  the  first  next  event  is  observed 
at  time  t  >    is  given  [33]  by 


xif(t/)  =  xif(t)  +  y]  / 

n=l 


R(t")ir{t  -  t")dt\  (10) 


with 


(11) 


n=l 


where  V^„(0  denotes  the  n -times  convolution  of  if{t).  R(t)  is 
therefore  the  probability  density  of  having  an  event  occurring 
exactly  at  time  t. 

It  can  be  shown  as  well,  see  Ref.  [33],  that  the  autocorrela- 
tion function  of  the  process  is  related  to  if{t,t')  by 


dx\lr{x,t')  =  ^(t/), 


(12) 


and  therefore  coincides  with  the  survival  probability  ^(t,t^) 
for  the  first  event  (i.e.,  the  probability  that,  for  fixed  t\  no  event 
is  observed  until  time  t  >  t').  We  assume  that  the  fluctuation 
^sit)  generated  by  5*  is  a  dicothomous  renewal  process  defined 
by  the  probability  density 


i^sit)  =  (fis  -  1): 


(13) 


(t  +  TsY^ 

We  therefore  name,  respectively,  \l/s(t,t^),  Rs(t),  and  ^^(^,^0 
the  functions  obtained  by  replacing  in  Eqs.  (10),  (1 1),  and  (12) 
\l/(t)  with  \l/s(t). 

Let  us  consider  now  the  Gibbs  ensemble  of  composite 
systems  5*  +  P,  and  evaluate  the  average  (^sO))sp-  Note  that 


the  average  is  over  the  separate  statistics  of  the  two  systems  S 
and  P 


&(0> 


SP 


(14) 


We  select  all  the  responses  to  the  same  perturbation,  charac- 
terized by  a  given  §^(0,  we  evaluate  their  average,  denoted  by 
{^s(0)s^  and  finally  we  construct  the  average  over  all  possible 
perturbations  denoted  by  {...)p  so  as  to  obtain  the  final  result 
denoted  by  (...)    .  In  conclusion  with  this  procedure  we  obtain 


{cr(t))  =  mm 


Jo 


x(t,mp(t')),  (15) 


where,  for  notational  convenience,  we  drop  the  subscripts,  but 
we  understand  the  averages  in  the  sense  described  above. 

If  necessary,  the  signal  ^p(t)  must  share  the  same  properties 
as  ^sO)  and  for  simplicity  they  are  both  assumed  to  be  di- 
chotomous signals  with  random  renewal  fluctuations  between 
the  values  +1  and  —1.  ^p(t)  is  therefore  a  non-Poissonian 
dichotomic  fluctuation  with  the  following  waiting- time  PDF: 


ifp(t)  =  (ixp  -  1)- 


(16) 


it  +  Tpyp 

It  is  therefore  convenient  to  define  the  additional  functions 
\l/p(t,t'),  Rp(t),  and  ^p(t,t')  obtained,  analogously  as  for  5, 
from  Eqs.  (10),  (11),  and  (12)  after  replacing  i/^(0  with  the 
waiting-time  distribution  \l/p(t). 

The  spectrum  of  this  type  of  fluctuating  signal,  in  the 
absence  of  perturbation,  as  calculated  in  Refs.  [32,34],  is 


S(f)  oc  L^-^f^- 


(17) 


valid  for  /x  <  2,  remarkably,  even  though  a  stationary  auto- 
correlation function  cannot  be  defined  in  this  case.  In  the  case 
/X  >  2,  S(f)  =  A/f^-f^,  with  A  independent  of  L,  the  length 
of  the  sequence  under  study. 

At  this  point  it  should  be  clear  to  the  reader  that  to  get 
the  important  results  of  this  paper  on  the  transmission  of  the 
statistical  properties  of  P  to  5*  we  must  use  Eq.  (15).  This 
leads  us  to  give  a  prescription  to  define  {^p(t)).  We  assume 
that  the  perturbation  P  is  prepared  at  ^  =  0  at  the  beginning  of 
a  laminar  phase  (i.e.,  with  the  analogous  prescription  adopted 
for  the  system  S).  This  assumption  allows  us  to  replace  {^p{t')) 
with^p(rO  in  Eq.  (15). 

III.  PHENOMENOLOGICAL  APPROACH 

In  this  section  we  study  the  response  of  a  complex  system 
producing  non-Poissonian  renewal  fluctuations  to  a  perturbing 
signal  with  similar  properties  within  the  phenomenological 
approach.  We  analyze  both  the  average  response  and  the 
input-output  correlation  [i.e.,  the  correlation  between  the 
perturbing  fluctuating  signal  (input)  and  the  signal  produced 
by  the  system  of  interest  S  (output)] .  In  the  phenomenological 
approach,  the  waiting  times  between  the  events  generating  the 
dichotomic  fluctuations  remain  unchanged  by  the  perturbation. 
The  external  perturbation  introduces  a  bias  so  that  when 
an  event  occurs  the  probability  that  the  dichotomic  variable 
changes  or  keeps  its  value  is  slightly  different.  The  function 
X(^,^)  in  this  approach  is  given  by  [15,16] 

Xit,t')  =  ^^^^  =  Rs(t')^sit  -  t').  (18) 
dt' 
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The  function  ^s(tJ^)  is  the  autocorrelation  function  of  ^s(0^ 
namely,  the  survival  probability  of  age  t\  and  Rs(t)  for  the 
case  of  the  discrete  signals  considered  here,  is  the  rate  at  which 
events  are  produced  by  S  prepared  at  ^  =  0  [i.e.,  the  bits  per 
second  encoded  in  ^s(t)]-  This  rate  is  time  independent  only 
in  the  Poisson  case.  In  the  non-Poisson  case  it  depends  on 
time,  thereby  making  ^s(t,t^)  nonstationary.  The  brand  new 
survival  probability  ^5 (0  =  ^s(tj'=0)is  given  by  [15,16,18] 

^,(0  =  (l+^/^,)l-^^  (19) 

from  which  the  corresponding  waiting-times  PDF  \l/s(t)  = 
is  derived.  In  the  range  of  parameters  1  <  /x^  <  3 
considered  here,  it  is  known  [10]  that 

Rsit)^-'^^{Ts/tf-'^^,    for  2,  (20) 

Rs(t)-  -[l  +  (Ts/tr'-\    for2</x,  <3,  (21) 

with  Ts  =  Ts/iiJis  —  2)  the  mean  value  of  if  sit)- 

When  fis  <  ^  the  experimental  preparation  of  S  induces 
a  sequence  of  events  whose  rate  Rs  tends  to  vanish  for  t 
CO,  yielding  a  perennial  out-of-equilibrium  condition,  and  an 
explanation  of  the  death  of  linear  response  [18-24,28]  as  well. 
In  fact,  the  response  to  a  harmonic  perturbation  of  frequency 
/  is  proportional  to  l/(ft)^~^^  [18].  In  the  case  2  <  /x^  <  3, 
on  the  contrary,  the  preparation-induced  cascade  of  events,  in 
the  limit  t  ^  oo,  becomes  stationary  and  virtually  identical  to 
that  of  a  Poisson  process.  The  theoretical  analysis  of  this  paper 
is  done  in  the  asymptotic  time  regime.  Thus,  we  refer  to  the 
case  2  <  /X  <  3  as  stationary,  in  contrast  to  the  nonstationary 
case  /X  ^  2  of  perennial  transition.  Similarly  to  the  rate  of 
events  Rs(t)  the  spectral  intensity  per  unit  time  tends  to  vanish 
for  /X  <  2  as  an  effect  of  increasing  L  [see  Eq.  (17)].  The 
ideal  1  //-noise  condition,  corresponding  to  /x  =  2,  generates 
instead  a  logarithmic  decrease  of  the  spectral  intensity  with 
time,  and  consequently  a  spectrum  virtually  independent  of  L. 

A.  Average  response  to  perturbation 

As  previously  mentioned,  the  nonstationary  LRT  (NSLRT) 
of  Eq.  (3)  rests  on  the  preparation  of  S  at  time  ^  =  0.  We  apply 
the  same  preparation  condition  to  P,  thereby  generating  the 
cascades  Rs(t)  and  Rp(t)  described  by  Eqs.  (20)  and  (21),  with 
the  appropriate  indexing.  Under  this  condition  the  relaxation 
of  (§p(0)  becomes  identical  to  the  survival  probability  ^p(t). 
Assuming  the  condition  of  Eq.  (18)  we  have  the  following 
expression  for  the  average  response 

(a(0)  =  s  [  Rs(t')^s(t  -  t')^p{t')dt\  (22) 
Jo 

The  preparation  of  both  S  and  P  makes  the  average  over  many 
realizations  of  the  response  a(0  to  a  given  stimulus  P  vanish 
for  t  CO. 

Stationary  case:  2  <  /x^  <  3. 

In  this  regime  a  finite  time  scale  for  the  fluctuation  exists 
and  Rs(t)  reaches  the  constant  value  l/r^.  The  inverse  power- 


law  relaxation  of  ^s(0  allows  us  to  approximate  Eq.  (22)  by 
replacing  Rs(t^)  with  its  value  for  t^  ^  t,  that  is, 

(a(0>  -  €  [  dt'^sit  -  t')^p{t')lTs,  (23) 
Jo 

which  becomes  exact  for  ^  co.  The  asymptotic  behavior  of 
Eq.  (23)  is  easily  obtained  in  the  Laplace  domain 

1  1  -  his)  1  -  h(s) 
(a(s))  :^  €  ,  (24) 

Ts  S  S 

which  can  be  studied  in  the  limit  of  small  s.  In  fact,  since  [35] 
hs)^l-Ts  ^r(fi-2)sf'-\    /x>2,  (25) 

and 

ir(s)^l  +  r(ii-2)sf^-\    /x<2,  (26) 

it  follows  that  for  1  <  /xp  <  2  and  2  <  /xp  <  /x^,  the  time- 
asymptotic  behavior  is  (cr(t))  ~  which  is  proportional 
to  {^p(t))  for  large  t,  meaning  that  the  system  S  "inherits"  the 
relaxation  properties  of  the  perturbation  P . 

For  2  <  /x^  <  /xp,  instead,  the  asymptotic  dominant  term 
is  (cr(t))  ~  t^~^^,  which  is  proportional  to  the  ordinary 
unperturbed  relaxation  to  equilibrium  (^siO)^  when  an  initial 
bias  for  ^siO  is  introduced.  We  see  therefore  that  for  /xp  <  /x^, 
when  ^p(t)  is  slower  than  §^(0,  the  perturbation  P  imposes 
on  the  system  S  its  own  relaxation  properties  thereby  allowing 
one  to  "manage"  the  complexity  of  a  system  or  network  by 
using  an  appropriate  stimulus. 

Nonstationary  case:  1  <  /x^  ^  2. 

In  this  regime,  the  system  S  violates  the  finite-time 
scale  condition  necessary  for  stationary  dynamics  and,  in 
fact,  Rs(t)  oc  t^^~^,  see  Eq.  (20).  With  such  replacement  in 
Eq.  (22),  a  convolution  form  appears  which  can  easily  studied 
via  a  Laplace  transformation.  In  the  Laplace  domain  (see 
Appendix  A  for  details  on  coefficients) 

(a(s  2^  0))  2^  as  s^'-^  +  ap  s^'-^,  (27) 

which  implies  that  if /xp  >  2orif  1  <  /x^  <  fip  then  {a (t))  ~ 
When  1  <  /xp  <  /x^,  we  have  (cr(t))     r^"^^.  Also  in 
this  case  the  perturbation  P  forces  onto  S  its  own  relaxation 
properties  to  equilibrium. 

B.  Input-output  correlation  function 

We  study  the  cross-correlation  (or  input-output  correlation) 
function  between  the  system  S  and  the  stimulus  P  :  C(t)  = 
{{^s  (O^p  (0>).  which  is  also  used  as  an  indicator  of  aperiodic 
stochastic  resonance  [36].  Multiplying  both  sides  of  Eq.  (3)  by 
^p(t)  and  averaging  over  the  fluctuations  of  the  perturbation 
P  we  obtain 

<t>(t)^C(t)/s=  [  dt'Rs{t')^s{t-t')^p{t,t').  (28) 
Jo 

Note  that  both  Eqs.  (22)  and  (28)  depend  on  the  survival 
probability  of  P,  but  in  the  former  such  survival  probability 
depends  on  the  single  time  t'  whereas  in  the  latter  it  depends 
on  both  t'  and  t.  We  limit  ourselves  to  report  the  results  for 
the  asymptotic  value  Ooo  of  0(0-  When  ^sit)  and  ^p{t)  are 
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not  stationary,  that  is,  when  1  <  /x^  ^  2  and  1  <  iip  ^2, 
Eq.  (28),  in  the  Hmit  ^  ^  oo,  gives 

^oo  =  ^{l^s.l^p)  =  r(/X5  -\-  fip  -2) 

3F2  [{f^p  -  l,l^p  -  l,l^p  +  l^s-  2},{fip,fip},l] 


r(2  -  fip)r(fip)^r(fis  -  i) 


(29) 


where  3F2  is  the  generaUzed  hypergeometric  function.  For 
more  details  see  Appendix  A.  In  the  case  2  <  /xp  <  3,  Ooo  is 
simply  zero. 

In  the  case  2  <  /x^  <  3,  inserting  into  Eq.  (28)  expression 
(21)  for  Rs(t),  leads  to 


0(0 


Jo 


^s(t)^p(t,0) 


^p(t/)  =  ^s(0)^l^p(t,t) 


f  dt' 
Jo 


,  d^p(t,t') 


(30) 


where  ^^^(0  is  given  by  Eq.  (19)  after  replacing  fis  with 
fjis  —  I.  Equation  (30)  is  exact  for  t  ^  Ts  and  for  1  <  /xp  ^  2 
it  leads  to  Ooo  =  1  since  both  the  second  and  third  terms 
disappear  for  ^  ^  00  and  the  first  is  trivially  1  in  the  same 
limit.  For  Eq.  (30)  2  <  /xp  <  3  it  yields 


Ooo  =  1  -  (/xp  -  2)T^'~^T^'~^AT'^-^'-^' 


xB 


AT/Tp        ,  A  ^-f^P 

_Ar/r,'^^  +  ^^-^'3-/x, 

lis -2 


AT^O 


lip  +  lis- ^ 


(31) 


where  AT  =  \Ts  —  Tp\,  B[x,a,b]  is  the  incomplete  Beta 
function  and  the  upper  (lower)  choice  of  the  parameters  refers 
to  the  case  Ts  >  (<)Tp.  The  final  expression  in  Eq.  (31) 
corresponds  to  the  case  Ts  =  Tp.  The  results  are  summarized 
in  Table  I.  For  illustrative  purposes,  we  supplement  Table  I  with 
Fig.  1,  showing  the  three-dimensional  (3D)  plot  of  the  cross- 
correlation  function  O^o  in  the  same  parameter  range:  Square 
II  and  square  III  correspond  to  the  condition  of  minimal  and 
maximal  correlation,  respectively.  Intuitively  it  is  so  because  of 
the  difference  of  time  scales  between  S  and  P  in  such  regions. 
In  III  fluctuations  ^sO)  and  ^p(t)  have  a  finite  and  an  infinite 
time  scale,  respectively,  thereby  allowing  ^sO)  to  adapt  to  the 
stimulus-induced  bias  so  as  to  yield  maximal  correlation.  In 
II  the  role  of  the  time  scales  is  inverted,  the  bias  induced  by 
P  on  the  longer  (diverging)  time  scale  of  the  process  ^s(0 
is  asymptotically  averaged  out  due  to  the  many  intervening 
switching  events  of  ^p(t),  producing  no  correlation.  The 
vertex  /x^  =  /xp  =  2,  representing  a  1 //-noise  generating 
system  under  the  stimulus  of  a  l//-noise  perturbation,  marks 

TABLE  L  Summary  of  the  asymptotic  values  of  the  cross- 
correlation  function  <^(t)  in  the  phenomenological  case. 


lisi  Mp^ 

1  <  /xp  ^  2 

2  <  /xp 

<  3 

1</X5^2 

2</xs<3 

Ooo  =  1  in 

^00  =  0 

II 
IV 

*See  Eq.  (29). 


FIG.  1.  (Color  online)  The  asymptotic  limit  of  0(0  is  displayed 
for  /X5,/xp  e]l,3[-  The  vertex  /xs  =  /ip  =  2  marks  the  transition  to  a 
condition  of  maximal  input-output  cross  correlation. 

the  abrupt  transition  from  vanishing  (square  II)  to  maximal 
correlation  (III). 

IV.  DYNAMICAL  APPROACH 

In  this  section  we  extend  the  analysis  of  the  previous  section 
to  the  dynamical  approach.  Within  such  approach  we  derive 
both  the  average  response  and  the  input-output  correlation 
function. 

A.  Average  response  to  perturbation 

Starting  from  the  property 

{cr(t))  =  (32) 

obtained  by  averaging  over  the  fluctuations  of  both  S  and  P 
and  using  the  dynamical  condition  of  Eq.  (9),  one  obtains 


(33) 


(34) 


X(tj')  =  --^s(t,t')  =  xlfs(t,n 
dt 

Equation  (3)  then  becomes 

(a(t))=€  f  dt'xlfs(t/)(^p(t')). 
Jo 

For  simplicity  we  prepare  the  perturbation  ^p(t)  3.1 1  =  0. 
To  observe  the  influence  of  P  on  5*  we  select  all  the  systems 
of  the  Gibbs  ensemble  so  that  §^(0)  =  1.  In  this  case  (^p(t')) 
is  given  by  the  survival  probability  ^p(t)  [33].  Thus,  Eq.  (34) 
yields 


(a(0>  =  €  [  dt'xlfs(t/)^p(t\ 
Jo 


(35) 


where  ^p(t^)  is  the  survival  probability  for  the  process  ^p(t) 


^P(t)  =  ^ 


dx\l/p(x)  = 


Tp 


t  +  Tp 


jXp-l 


(36) 


This  slow  decay  corresponds  to  the  probability  that  no 
perturbation  event  occurs  up  to  time  t.  The  system  S  evolves 
in  time  so  as  to  reach  a  steady  value  that  corresponds  to 
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a  constant  perturbation  abruptly  applied  at  ^  =  0.  However, 
during  this  process  a  perturbation  event  occurs  that  has  the 
effect  of  suddenly  changing  the  external  field.  Thus,  a(t)  does 
not  reach  a  steady  value,  but  after  reaching  a  maximum  value 
will  decay.  Under  the  specific  conditions  discussed  in  this 
section,  the  time  asymptotic  decay  of  {cr(t))  in  Eq.  (35)  has 
the  same  power-law  index  as  that  of  survival  probability  ^p(t). 
We  interpret  this  phenomenon  as  the  transmission  into  S  of  the 
statistics  of  P. 

We  explore  again  the  whole  range  of  parameters  I  <  fis  < 
3  and  1  <  /xp  <  3,  respectively,  for  the  system  S  and  the 
perturbation  P,  depending  on  the  values  of  the  power-law 
indexes  fis,  f^p  characterizing  their  waiting-times  PDFs.  The 
value  11  =  2  marks  the  transition  from  a  finite  to  an  infinite 
mean  time  (i.e.,  the  transition  to  a  nonergodic,  nonstationary 
condition.  In  fact,  while  for  /x  >  2  a  mean  time  exists,  a  finite 
time  scale  can  be  defined  and  a  stationary  condition  is  reached, 
for  /X  <  2  such  condition  is  never  achieved,  not  even  in  the 
infinite  mean  time. 

Stationary  case:  2  <  /x^  <  3. 

In  this  regime  the  waiting-times  PDF  ^Ifsit)  has  a  finite 
mean  value  r^.  Therefore  a  finite  time  scale  tc  exists  such  that 
for  t  >  t'  >  tc  the  following  approximation  corresponding  to 
reaching  a  stationary  condition,  is  valid  [33] 

^s{t  -  t') 


ifsit.t 


dx\l/s(x  —  t')  = 


(37) 


where  Xs  =  Ts/iiis  —  2)  is  the  mean  value  of  \l/s(t).  Equa- 
tion (37)  is  exact  for  ^  oo.  We  can  also  use  Eq.  (3)  for 
the  response  in  the  case  when  the  interaction  is  turned  on  at 
a  later  time  tc  that  we  assume  to  be  so  large  as  to  satisfy 
the  approximation  Eq.  (37)  using  the  following  procedure. 
We  introduce  into  Eq.  (3)  an  effective  perturbation  which  is 
turned  on  at  time  ^  =  0  but  is  zero  until  tc  [i.e.,  ^p^(t)  = 
0(t  —  tc)^p(t  —  tc)].  In  this  scheme  we  can  assume  that  the 
"real"  process  ^p(t)  is  prepared  at  time  t  =  tc  ina.  brand  new 
condition.  Time  tc  therefore  corresponds  to  the  age  of  the 
system  S.  The  average  response  {cr(t))  of  S  in  this  case  reads 


{cr(t)) 


=  s  [  dt'fs{t,t')m'  -  tc)^pit'  -  tc)) 

Jo 

=  8  [  dt'ifs{t,t')^p{t'  -  tc\ 

Jtr 


(38) 


J 


and  the  approximation  (37)  can  be  used  to  replace  \l/s(t,t^). 
With  the  substitution  r  =  t'  —  tc  and  after  renaming  t  —  tc 
t  back  again,  the  average  response  of  the  system  S  of  age  tc 
reads 


(^(0> , 


Jo 


^s(t  -r) 
dz—  ^^p(r)  =  ^5(0)^p(0 

^s(t)^p(0) 


f  dt'^^ 
Jo 


s(t-t')--^p(t'), 
dt' 


where  ^^(0  is  defined  as 


^s(t)  =  {l+t/TsY 


(39) 


(40) 


Equation  (39)  is  exact  for  tc  ^  oo  and  coincides  with  the 
expression  (23)  obtained  in  the  phenomenological  case  in  the 
same  regime.  Therefore,  in  the  limit  t  ^  Ts,Tp,  the  same 
considerations  apply,  that  is,  if  1  <  fip  <  2  and  2  <  fis  <  l^p, 
the  time- asymptotic  behavior  is  {cr(0)  ~  t^~^^ ,  which  is 
proportional  to  {^p(t))  for  large  t,  therefore  the  system  S 
always  "inherits"  the  relaxation  properties  of  the  perturbation. 
If  2  <  /x^  <  /xp,  the  asymptotic  dominant  term  is  (cr(t))  ~ 
t^-^s^  which  is  proportional  to  the  ordinary  unperturbed 
relaxation  to  equilibrium  {^s(0)- 

Nonstationary  case:  1  <  /x^  <  2. 

Let  us  make  the  assumption  that,  although  at  time  ^  =  0 
half  of  the  S  elements  of  the  Gibbs  ensemble  are  in  the 
state  §^  =  +1  and  half  in  the  state  §^  =  —  1,  all  of  them 
are  at  the  beginning  of  their  sojourn  in  the  corresponding 
states.  This  is  an  out-of-equilibrium  condition,  corresponding 
to  preparing  the  system  at  ^  =  0.  The  calculations  are  detailed 
in  Appendix  B.  Using  the  dynamic  theory  we  obtain 


(a(0)  ^,  ki(^s,f^p)  _^  hil^s.l^p) 


where  the  first  coefficient  is  given  by 

Tp^'-'  sm(7tfis)r(2  -  fis)r(l  -fip+  fis) 
ki(fis,f^p)  =  —  ^  

and  the  second  coefficient  is  determined  to  be 


(41) 


(42) 


hif^s^f^p) 


[sin(7r/X5)r(3  -  /X5)r(l  -  /xp  +  /X5)r(2/X5  -  3)  -  (2  -  /X5)r(2/X5  -  I^p  -  1)] 


Tp'-f^'Ts^'-^niixp 


2)(2  -  /X5)r(3  -  iis)r(fis  -  f^p)r(2iis  -  3) 


(43) 


In  this  range  of  parameters  the  dominant  term  is  always  the 
first  term  in  Eq.  (41)  which,  if  /xp  <  /x^,  is  also  slower  than 
the  unperturbed  relaxation  to  equilibrium  of  §5.  In  the  latter 
range,  therefore,  the  system  S  relaxes  to  equilibrium  inheriting 
the  same  properties  of  the  perturbation  P. 

The  result  in  Eq.  (41)  is  of  special  interest  since  it  discrimi- 
nates between  the  two  approaches  in  the  nonstationary  regime. 
It  is  this  difference  that  allowed  to  determine  that  liquid  crystals 
[13]  follow  the  prediction  of  the  dynamical  approach.  In  fact. 


the  phenomenological  approach  disregards  the  influence  of  the 
perturbation  on  the  occurrence  time  of  the  S  events  [19,28], 
while  the  dynamical  theory  does  not,  thereby  affording  a 
criterion  for  information  transport  that  we  judge  to  be  a  more 
appropriate  representation  of  the  communication  among  com- 
plex systems  with  /x  <  2.  However,  the  equivalence  between 
the  phenomenological  and  the  dynamic  theories  in  the  case 
when  the  system  S  is  infinitely  aged  (i.e.,  for  fis  >  2)  indicates 
that  the  generalization  of  FDT  given  by  Eq.  (9),  namely,  the 
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dynamical  theory,  becomes  active  only  when  the  system  S 
is  in  a  far  from  equilibrium  condition  and  begins  drifting 
toward  equilibrium.  Although  equilibrium  is  never  reached 
when  fi  <  2,  the  correlation  function  ^fsOJ^)  tends  to  recover 
the  property  ^5 (^,^0  =  ^^(^  —  ^0  that  makes  the  phenomeno- 
logical  theory  formally  equivalent  to  the  dynamical  theory. 

B.  Input-output  correlation  function 

Herein  we  study  the  asymptotic  limit  of  the  input-output 
correlation  function 


Ooo  =  lim  C(t)/€, 


(44) 


within  the  dynamical  approach.  The  input-output  correlation 
function  is  again  defined  by  the  average  over  the  fluctuations 
in  both  the  system  S  and  perturbation  P 


C(t)^{^s(mp(t)}. 


(45) 


The  asymptotic  limit  of  C(0  is  independent  of  the  way  the 
system  and  the  perturbation  are  prepared,  so  we  can  use  the 
prescription  leading  to  Eq.  (3)  obtained  assuming  that  both  S 
and  P  are  prepared  at  time  ^  =  0.  We  therefore  use  the  same 
arguments  as  those  yielding  Eq.  (28)  and,  adapting  them  to  the 
dynamic  theory,  we  obtain 


0(0  =  C(t)/€  =  [  dt'fs(t,t')^p(t,t'). 
Jo 

Nonstationary  case  I:  ^jls  ^'^^  l^p 


(46) 


We  use  Eq.  (46)  with  the  general  expressions  for  yjrsit.t') 
and  ^p{t,t')  as  obtained  through  Eqs.  (10)  and  (12),  re- 
spectively. In  this  case,  taking  the  limit  t  ^  oo  yields  (see 
Appendix  B  for  details) 

^.         ^        sinjr/xp  r(/xp  + /x^  -  1) 


lim  0(0 

t—^oo 


71    (fip  -  iwifip  +  i)r(fis  -  1) 

X  F  [{fip  -  l,fip  -  l,fip  +  lis  -  l},{/xp,/xp  +  1},1] . 

(47) 


Nonstationary  case  II:  /x^  <  2,  /xp  >  2. 


In  this  case  we  can  assume  that,  when  the  interaction  is 
turned  on,  the  perturbation  P  has  already  reached  a  stationary 
condition,  so  that  in  Eq.  (46)  \l/s(t,t^)  is  given  again  by  Eq.  (10) 
but  ^p(ty)  =  "^pit  -  t'),  with 


^p{t)  =  {\+t/TpY 


(48) 


and  this  expression  has  to  be  directly  inserted  into  Eq.  (46). 

The  power-law  index  fip  —2  reflect  the  stationary  condi- 
tion realized  with  the  preparation  of  the  perturbation  P  at  a 
time  tp  =  —00.  In  this  case,  we  obtain 


lim  0(0  =  0. 


(49) 


In  the  time  asymptotic  limit,  the  system  S  turns  out  to  be 
independent  of  P  in  spite  of  the  fact  that  at  ^  =  0  we  switch 
on  the  S-P  interaction. 

Stationary  case  I:  iis  >     l^p  > 

We  again  use  Eq.  (46)  and  assume  that,  when  the  interaction 
is  turned  on,  the  perturbation  P  has  already  reached  a 
stationary  condition  so  that  in  Eq.  (46)  ^p(t,t')  =  'ifp(t-  t') 


with  ^p{t)  given  by  Eq.  (48).  For  /x^  >  2  a  finite  mean  time 
Tg  of  V^(0  exists,  therefore  a  finite  time  scale  tc  oc  xg  exists 
such  that  for  t  >  tc  Eq.  (37)  can  be  used  again  to  approximate 
\l/s(t,t^).  With  such  substitutions,  the  expression  for 
the  correlation  function  becomes  asymptotically  equal  to  the 
correlation  obtained  in  the  phenomenological  approach  in  the 
same  regime  [i.e.,  Eq.  (30),  leading  to  the  final  result  Eq.  (31)]. 

Stationary  case  II:  iig  >     l^p  <  ^^ 

In  this  case  again  a  finite  time  scale  tc  can  be  found  such 
that  the  approximation  in  Eq.  (37)  is  valid.  Thus,  again  Eq. 
(46)  can  be  rewritten 


0(0 


Jo 


dt'irs{t,t')^p{t,t') 


+ 


J,c  Vdt' 


^sit  - 1') 


(50) 


This  case  is  therefore  equivalent  to  what  is  obtained  for 
the  phenomenological  case  in  the  same  range  of  parameters. 
Again,  in  the  asymptotic  limit  t  oo,  the  first  term  in  Eq.  (50) 
vanishes  and  after  integrating  the  second  term  by  parts,  one 
obtains 


<D(f)  ~  >I's(0)*/.(?,0  -  ^sit  -  tc)^p(t,tc) 


dt'^sit-t')  2  ■ 

at' 


(51) 


In  the  asymptotic  limit  the  second  term  on  the  right-hand 
side  trivially  vanishes  and  also  the  third  term  side  can  be  shown 
to  vanish  (see  Appendix  B).  The  only  remaining  contribution 
in  Eq.  (51)  is  given  by  the  first  term,  which  is  exactly  one.  It 
follows 


Ooo  =  lim  C(OA  =  1. 


(52) 


Let  us  make  here  a  few  remarks  on  these  results.  Of  course 
in  the  absence  of  coupling,  €  =  0,  which  means  that  C{t) 
always  vanishes.  We  switch  the  interaction  on  at  ^  =  0.  Thus, 
we  always  have  the  zero  cross-correlation  initial  condition 
C(0)  =  0. 

As  an  effect  of  switching  on  the  interaction  at  ^  =  0  we 
realize  the  condition  C{t)  ^  0,  for  ^  >  0.  However,  we  find 
that  there  exist  special  conditions  (region  II  in  Fig.  2)  for 
which  the  cross  correlation  again  goes  to  zero,  asymptotically 


Ooo  =  lim  C(0/€  =  0. 


(53) 


This  indicates  that  only  in  such  conditions,  the  system  S,  after 
a  transient,  recovers  the  condition  of  statistical  independence 
of  the  perturbation  §p(0.  The  fluctuations  with  /x^  <  2  turn 
out  to  be  statistically  independent  of  ^ pit)  only  when  /xp  >  2. 
The  environmental  perturbation  with  [ip  <  2,  on  the  other 
hand,  is  characterized  by  the  remarkable  property  of  forcing 
the  system  S  to  respond,  regardless  of  the  value  of  /x^. 

Table  II  and  the  plot  in  Fig.  2  summarize  such  results 
and  show  the  same  qualitative  properties  observed  for  the 
correlation  function  in  the  phenomenological  case,  with  the 
correlation  functions  for  the  two  approaches  being  identical 
in  squares  I,  II,  and  IV.  The  condition  of  ideal  1//  noise  (i.e., 

=  fip  =  2)  marks  the  transition  from  a  condition  of  zero 
to  maximal  correlation. 
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TABLE  11.  Summary  of  the  asymptotic  values  of  the  cross- 
correlation  function  <^(t)  in  the  dynamical  case. 


In  the  same  long-time  scale,  Eq.  (57)  yields 


1  <  /Xp  ^  2 

2  <  fip 

<  3 

2</X5<3 

^oo  =  1  III 

^00=0 

II 
IV 

*See  Eq.  (47). 

V.  FINITE  RESPONSE  AND  1//  RESONANCE 

Herein  we  proceed  to  demonstrate  that  the  intensity  of 
the  response  a(0  to  a  single  realization  of  the  stimulus  does 
not  decay  if  Ooo  ^  0.  This  demonstration,  therefore,  is  valid 
for  both  the  dynamical  and  the  phenomenological  approach, 
leading  to  a  general  result.  Let  us  define  with  pti^s^  the 
probability  that  at  time  t  the  variable  takes  the  value 
i  =  ±1  and  with  /?^(§^|§p)  the  conditional  probability  for 
the  occurrence,  at  time  ^,  of  a  value  =  /  =  ±1,  given 
the  occurrence  of  a  value  §p  =  j  =  ±1.  By  definition,  the 
nonvanishing  Ooo  yields 

C(t)  ^  J2      P^i^sl^i)  P^i^i)  ^  ^^oo.  (54) 

i,j=±i 

We  note  that  for  ^  ^  oo,  on  a  time  scale  such  that  {^p(t)), 
which  decays  as  (^p(0))t^~^^ ,  is  a  second-order  quantity 
0(£^)we  have  that 


and 


0(0  =  Ooo  +  0(s\ 


(55) 


(56) 


due  to  Eq.  (54)  and  to  the  symmetry  of  the  considered 
mous  Drocesses 


Thus  is 

dichotomous  processes 


I  js- 


(57) 


FIG.  2.  (Color  online)  The  asymptotic  limit  of  <^(t)  is  displayed 
for  the  different  regimes  of  parameters  for  1  <  /xs  <  3  and  1  <  jjip  < 
3,  in  the  dynamical  approach. 


{cf{t))-_ 


(58) 


where  the  subscript  ib  indicates  the  value  of  at  time  t. 
Summing  Eq.  (58)  over  the  two  values  of  §p  gives  a  total 
average  null  response,  as  expected.  But  if  the  magnitude  \a{t)\ 
of  the  response  to  a  single  instance  of  the  input  ^p{t)  is 
considered  instead,  its  total  average  is 

(i^wi)  =  \  Y.^wm±  >  ^  E  i{^(o>±i  -  ^^oo,  (59) 

where  an  equality  holds  if  terms  of  order  0{s^)  are  neglected. 
Thus  when  Ooo  >  0,  the  response  ^(O  to  a  single  instance  of 
the  input  ^p{t)  does  not  die  out  and  remains  proportional  to 
the  stimulus  intensity,  no  matter  how  large  t  becomes.  Square 
III  in  both  Figs.  1  and  2  is  the  plateau  region  of  maximal  cross 
correlation  and  response.  Claims  regarding  the  death  of  linear 
response,  as  in  Refs.  [19-28],  are  therefore  appropriate  only 
in  relation  to  the  vanishing  correlation  of  square  II.  The  total 
average  response  {cf(t))  always  tends  to  vanish  for  ^  ^  oo  for 
reasons  that  do  not  imply  a  lack  of  response  except  in  the  case 
of  square  II. 

The  reason  for  the  striking  difference  between  the  response 
to  a  harmonic  perturbation  and  the  response  to  a  nonergodic 
stimulus  is  intimately  related  to  the  emergence  of  1  //  noise 
and  to  its  spectrum  described  by  Eq.  (17),  which  assigns  the 
weight  S(f)/L  =  l/(/L)^~^^  to  the  spectral  component  of 
frequency  /  of  a  nonergodic  stimulus.  As  a  consequence,  the 
stimulus  generates,  in  time,  lower  and  lower  frequencies  /  so 
as  to  keep  1  / (fL)^~^^  (i.e.,  the  response  intensity  to  frequency 
/  [18])  finite,  thereby  yielding  Eq.  (59).  The  death  of  linear 
response  [19-28]  is  caused  by  the  fact  that  stimuli  with  fixed 
frequencies  cannot  cope  with  the  decreasing  frequency  of  the 
cascade  of  events  of  Eq.  (20). 

We  have  afforded  a  compelling  proof  that  the  intensity  of  the 
single  realizations  of  a(t),  with  fis  <  2,  does  not  decay  if  the 
perturbation  §P  (0  falls  in  the  same  complexity  basin  (/xp  <  2). 
This  is  the  phenomenon  of  complexity  management  which 
allows  to  define  the  right  stimulus  to  obtain  a  response  from 
a  system  or  network  producing  nonergodic  fluctuations.  Now 
we  argue  that  1/f  stimuli  generate  the  maximum  information 
transport  by  looking  at  the  mutual  information 

i(t)  =  ^p,(i;)p,(i^|§;)iog[/,,(§^|§;)/p,(§^)].  (60) 


Using  Eqs.  (55),  (56),  and  (57)  it  follows  that 
I(t  ^  00)  ^  s^<^l, 


(61) 


and  the  information  transmission  rate  is  obtained  by  multiply- 
ing 7(0  by  the  input  rate  [37],  given  by /?p(0.  If /xp  >  2,Fig.  1 
shows  that  Ooo  <  1.  Although  square  III  in  Figs.  1  and  2,  indi- 
cates that  all  stimuli  with  fip  ^  2  induce  maximal  correlation, 
jjip  <2  corresponds  to  a  stimulus  with  decaying  events  rate 
(input  bits/s)  Rp(t).  So  even  if  a  response  is  produced  in  this 
regime,  the  rate  of  information  vanishes  in  time. 

Only  at  the  crucial  condition  /xp  =  2,  of  ideal  l/f  noise, 
does  this  algebraic  decay  become  logarithmic,  and,  conse- 
quently, a  steady  and  maximal  information  transmission  rate  is 
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achieved.  This  is  the  phenomenon  that  we  call  1  //  resonance. 
The  above  considerations  are  valid  for  both  the  dynamical 
and  phenomenological  approaches,  therefore  we  consider  the 
condition  of  maximal  information  transmission  achieved  in 
the  ideal  1 //-noise  condition,  a  fundamental  property  of 
1// -noise  renewal  processes. 

VI.  CONCLUDING  REMARKS 

The  growing  interest  for  the  dynamics  of  complex  networks 
is  shifting  the  attention  of  the  researchers  from  the  synchro- 
nization of  two  stochastic  units  [38]  to  the  synchronization  of 
a  large  number  of  units  [39],  an  interesting  phenomenon  that 
is  closely  related  to  the  very  popular  model  of  Kuramoto  [40]. 
The  single  units  of  the  processes  of  chaos  synchronization  are 
chaotic  and  do  surprisingly  synchronize  while  maintaining 
the  erratic  dynamics  that  they  have  in  isolation.  Although 
the  single  units  of  the  Kuramoto  model  are  regular,  it  is 
becoming  increasingly  evident  that  the  emergence  of  a  global 
synchronization  is  a  condition  independent  of  whether  the 
single  units  are  regular  or  stochastic.  The  single  units  of  the 
work  of  Refs.  [1 1,41]  are  Poisson  processes  and  if  one  of  them 
drove  the  other,  they  would  obey  the  principle  of  aperiodic 
stochastic  resonance  [42].  If  the  two  units  are  bidirectionally 
coupled  they  are  expected  to  undergo  a  condition  of  perfect 
synchronization  if  the  coupling  is  sufficiently  intense.  When 
the  number  of  interacting  units  is  very  large  a  phase  transition 
occurs  from  the  noncooperative  to  the  cooperative  behavior 
[1 1,41].  It  is  important  to  stress  that  at  criticality  no  permanent 
consensus  is  reached,  and  the  mean  value  of  the  global  field 
vanishes.  Yet,  this  condition  is  strikingly  different  from  the 
noncooperative  condition.  The  whole  network  remains  in  the 
"yes"  ("^6>")  state  for  an  extended  time  before  making  a 
transition  to  the  "^(9"  Cyes'')  state. 

It  is  surprising  that  the  phase-transition  literature  seems  to 
have  overlooked,  with  only  a  few  exceptions  [11,41,43],  that 
the  transitions  from  the  "yes''  ("no'')  to  the  "no"  ("yes")  state 
occurring  at  criticality  are  the  "crucial"  events  defined  in  Sec.  I. 
In  other  words,  the  time  interval  between  two  consecutive 
transitions  is  derived  from  a  PDF  that  has  the  asymptotic  time 
structure  of  Eq.  (1)  with  a  power  index  fi  fitting  the  inequality 
condition  of  Eq.  (2).  Some  authors  [1 1,41]  argued  that  /x  =  1 .5 
and  others,  [7],  releasing  the  condition  that  all  the  units  share 
the  same  Poisson  rate,  generate  a  global  condition  with  crucial 
events  characterized  by  /x  <  2,  but  significantly  departing  from 
the  value  /x  =  1.5.  Note  that  the  theoretical  arguments  of 
Ref.  [44],  yield  the  misleading  impression  that  the  crucial 
value  of  /X  is  a  consequence  of  ordinary  statistical  physics. 

According  to  some  authors  [45,46]  the  Kuramoto  phe- 
nomenon can  be  defined  as  inner  synchronization.  A  network 
of  cooperating  units  located  on  the  nodes  of  a  complex  network 
may  reach  inner  synchronization  with  different  values  of  the 
control  parameter,  depending  on  the  network  topology  [47]. 
This  is  a  subject  of  increasing  interest  with  attractive  appli- 
cations to  the  dynamics  of  the  human  brain  [45].  If  we  adopt 
this  perspective,  we  can  address  the  problem  of  information 
transmission  from  one  to  another  complex  network  as  a  process 
of  outer  synchronization.  This  is  an  interesting  issue,  but  the 
conditions  to  fulfill  to  realize  outer  synchronization  are  not  yet 
clear  [45]. 


An  important  result  of  this  article  is  the  discovery  of  a 
promising  road  to  settle  the  problem  of  information  trans- 
mission from  one  to  another  complex  network.  In  fact,  if  the 
inner  synchronization  corresponds  to  a  criticality  condition 
and  criticality  generates  crucial  events  with  a  power-law  index 
/X  fulfilling  the  inequality  of  Eq.  (2),  then  a  complex  network 
at  criticality  is  a  generator  of  1  //  noise,  with  S(f)  a  1  /f^~^. 
Thus  the  problem  of  information  transmission  from  one  to 
another  complex  network,  in  this  regime,  becomes  equivalent 
to  the  phenomenon  of  1  //  resonance  illustrated  in  this  article. 
This  is  essentially  the  main  result  of  earlier  work  [29].  The 
main  conclusions  of  Ref.  [29]  are  illustrated  by  Fig.  1,  which 
is  obtained  using  the  phenomenological  LRT. 

What  are  the  limits  of  this  earlier  result?  The  experiments 
[12,13]  yield  support  to  the  dynamical  rather  than  the  phe- 
nomenological LRT,  thereby  generating  doubt  that  the  results 
of  Ref.  [27],  although  very  attractive,  may  not  completely 
reflect  reality.  It  is  important  to  stress  that  phenomenological 
LRT  is  a  natural  consequence  of  adopting  the  asymptotic 
time  perspective  replacing  the  waiting-times  PDF  i/^(r)  of 
Eq.  (1)  with  i/^(r)  a  1/r^.  This  way  of  proceeding,  although 
generating  the  elegant  mathematics  of  fractional  derivatives, 
has  as  an  ultimate  effect  the  misleading  discovery  of  the  death 
of  linear  response  [19-24,28].  We  do  not  adopt  the  asymptotic 
time  perspective  but  the  special  form  of  Eq.  (1).  This  is  not 
a  unique  way  of  connecting  the  long-time  to  the  short-time 
regime.  However,  whatever  form  we  adopt  we  are  convinced 
that  there  will  be  a  parameter  playing  the  same  microscope 
role  of  the  parameter  T  of  Eq.  (1).  It  is  reasonable  to  assume 
that  an  external  perturbation  may  perturb  either  T  or  /x,  or  both 
of  them.  However,  the  perturbation  of  /x  is  incompatible  with 
the  assumption  of  a  weak  stimulus.  In  fact,  /x  is  a  consequence 
of  the  cooperation  among  the  units  of  the  network,  and  a 
perturbation  may  affect  /x  only  if  its  strength  is  large  enough 
to  influence  the  interaction  among  the  units  of  the  network. 
Thus,  an  external  weak  perturbation  can  only  have  an  effect  on 
r,  thereby  making  the  dynamical  LRT  become  the  proper  way 
to  study  the  response  of  a  complex  network  to  a  weak  external 
stimulus,  in  accordance  with  the  experimental  results  [12,13]. 

For  these  reasons,  we  can  conclude  that  Fig.  2  is  the  original, 
and  important,  result  of  this  paper.  We  hope  that  it  may  open  the 
road  to  the  dynamical  solution  of  the  problem  of  information 
transmission  [48]  from  one  to  another  complex  network,  a 
research  topic  that  is  still  in  its  infancy. 
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APPENDIX  A 

In  this  Appendix  we  record  more  details  about  the  deriva- 
tion of  both  the  average  response  {a(t))  and  the  input-output 
correlation  function  0(0  in  the  case  of  the  phenomenological 
approach. 

When  we  adopt  the  phenomenological  theory,  we  obtain 
for  the  average  response  to  external  perturbation,  in  the 
nonstationary  case  [cf.  Eq.  (27)]  the  following  asymptotic 
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expression: 


where  the  coefficient  of  the  first  term  is 

,      r(fip  -  fis)      r(i  -  fip -\- fis) 

ki(Lls,ljip)  =  , 

r(i  -  iip)r(iis)    r(2  -  iip)r(iis) 

and  the  coefficient  of  the  second  term 

Ml^s.l^p)  =  —  — — r. 

r(l  -  iis)T{l^p) 

Note  the  logarithmic  corrections  corresponding  to  /x^ 
with 


sin(7r/X5)  log^  _^  A{iis) 


(Al) 
(A2) 

(A3) 

=  l^p, 

(A4) 


and 


sm(7tfis) 


2y  -  2\I/l(iis) 


cos(7r/X5), 
(A5) 


where  y  is  the  Euler's  constant  and  \I/l(z)  the  logarithm 
derivative  of  the  F  function.  These  predictions  are  qualitatively 
equivalent  to  the  dynamical  theory  predictions  with  the 
assumption  that  the  system  S  has  been  prepared  in  the  very 
distant  past. 

The  general  expression  of  the  correlation  function  in  the 
phenomenological  approach  is  given  by  Eq.  (28). 

Nonstationary  case  I:  fis  <         <  2. 

In  this  range  the  asymptotic  approximation  for  the  function 
R(t)  defined  in  Eq.  (1 1)  for  both  S  and  P  is 

sin TTu  „  9 
R(t)  2^   -t^-^.  (A6) 
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The  power-law  properties  of  ^s(0^  in  the  long-time  limit, 
shift  the  dominant  contribution  to  the  integral  of  Eq.  (28)  to 
the  range  t'  ^  t,  this  allows  to  adopt  the  approximation  (A6) 
inside  the  integral.  It  follows 
sin  TT /X  5  sin  TT/xp 

^yl )  — 

X  /  T^'-^^s{t  -T)dT  f  x^'-^^p(t  -x)dx. 
Jo  Jo 

(A7) 

Using  a  generalized  Newton  binomial  expansion  of  the  power- 
law  form  of  the  functions  ^p(t)  we  obtain  the  following 
expression: 

n     )  (t  +  TpY+f^p-^ 


0(0^ 


smjr/x^  sm  Ttfip 


TV 


TV 


oo  . 


f 

Jo 


rn+iip+iis-^ 


^s{t  -  t')dt\ 


(AS) 


while  the  convolution  with  ^^(0  in  the  integral,  leads  to 
^^^^  ^  sin  TT/x^  sin  TT/xp       /I  -  iip\  (-1)" 


TV 


(t  +  TpY+^'p-^ 


fU+jXp  —  l 


Tin  +  fip+fis-  2)r(2  -  i^s) 


n  +  fjLp  -  I 


r(n-\-  fip) 


(A9) 


where  we  have  used  the  fact  that  for  t  large  the  main 
contribution  to  the  integral  comes  from  the  range  of  values 
X  <  r  <  ^  and  therefore  the  asymptotic  approximations  for 
Rs(t)  an  Rp(t)  are  justified.  The  limit  for  ^  ^  oo  cancels 
the  time  dependence  and  leaves  a  sum  which  leads  to  the 
expression  of  Eq.  (47)  in  the  text. 

Nonstationary  case  II:  /x^  <  2,/xp  >  2. 

In  this  case  the  function  Rp(t)  tends  asymptotically  to  a 
constant  value  and  therefore  it  easy  to  see,  following  analogous 
procedure  to  the  previous  case,  that  the  correlation  tends  to 
zero. 

Stationary  case  I:  lis  >  2,/xp  <  2. 

As  mentioned  in  the  text,  from  Eq.  (30)  it  is  enough  to  show 
that  the  third  term  on  the  right-hand  side  is  zero.  Such  terms 
read  as 


sin  TT/xp  ~ 
 —  /  ^5 

Jo 


(t  -  t')dt't'^'-^^p{t  -  t').  (AlO) 


Using  a  generalized  binomial  expansion  for  both  ^s{t)  and 
^p{t)  we  get 


0(0^ 


sm  TT /Xp 


TT 


EC 

n,m  ^ 


lip 


m    J  (t  -\-  TpY+^^p-^ 


it  +  7^)^+^5-2 


/' 

JO 


(All) 


Carrying  out  the  integration  shows  that  the  leading  term  in  t 
vanishes  for  ^  ^  oo  Such  a  demonstration  is  valid  also  for  the 
dynamical  case  in  the  same  range  of  parameters  [see  Eqs.  (51) 
and  (52)]. 

APPENDIX  B 

This  Appendix  is  devoted  to  detailed  calculations  involved 
the  evaluation  of  the  asymptotic  limits  of  the  function  0(0 
and  {a{t))  in  the  dynamic  approach. 

Herein  only  the  parameter  range  /x^  <  2,/xp  <  2  is  consid- 
ered, the  other  ranges  being  analyzed  in  detail  in  the  main  text 
and  in  Appendix  A. 

0(0  is  given  by  the  following  integral: 


^0 


4>(0  =  /  irs{t,t')^p{t,t')dt' 


(Bl) 


where  if  sit, t')  is  defined  in  Eq.  (10)  and  in  Eq.  (12), 

each  with  the  appropriate  labeling.  <S>{t)  can  be  decomposed 
as  sum  of  four  contributions 


cl>(f)  =  ^cl,,.(0, 


i=l 


where 


(B2) 


(B3) 


4^2(0  =  irsit)  f  f  Rp{t")^p{t  -  t")dt"dt' 
Jo  Jo 

=  xlfsit)  [it-  t')Rp(t')^p(t  -  t')dt\  (B4) 
Jo 
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<I>3(0  =  ^p(t)  [   [  Rs(t"ns(t  -  t")dt"dt' 
Jo  Jo 

Jo 


$4(0 


(t  -  t")Rs{t")fs{t  -  t")dt",  (B5) 


=  111  Rs{t")fs{t-t")Rp{T)^p{t-x)dt'dt"dT. 
Jo  Jo  Jo 

(B6) 


Here  xlfsiO  and  ^p(t)  are  defined  in  Eqs.  (13)  and  (36).  Using 
the  methods  of  Ref.  [49],  Eqs.  (B4),  (B5)  can  be  evaluated  in 
the  asymptotic  Hmit  to  yield 


Oi(0 


$2(0  : 


Since  in  the  Hmit  ^  ^  oo  the  three  contribute  vanish,  it  is 
not  important  the  evaluation  of  the  constants  ci,  C2,  C3.  To 
evaluate  04(0  we  first  need  an  analytical  expression  for  Rs{t) 
and  Rp{t).  It  can  be  shown  that  [49] 


sin(7r/X5)  1 


with  the  same  expression,  with  the  respective  parameters  being 
valid  for  Rp(t).  Then  we  have 


rt' 


Rs(T)ifs(t  -  T)dT 


lo  TCit  +  Ts) 

On  the  other  hand 

ft' 

Rp(T)^fp(t  -  T)dT 
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Jo 


it  +  TpY^-^ 


E 

n=0 


1  -  111 


(t  +  TpY 


X  /  /?p(r)r"Jr. 
Jo 


(B8) 


Using  for  Rp(t)  the  same  approximation  of  Rs(t)  Eq.  (B2), 
we  have 


Jo 


/?p(r)^p(r  -  T)dT 


smn  lip 


7t{t  +  Tpyp-^ 


E 

n=0 


1  -         i-T  ^^"+^^-1 


n     /  (t  -\-Tpy  n-\-  fip  -1 
(B9) 


(B7)      The  function  04(0  is 


sin  TT/xp  sin  TT/x 5  ^M5+mp  1 
irHt  +  Tp)f^^-^  it  +  TsTs 


00  / 

n=0 


+  Tp 


1  -  iip\  F{n-\-  fip  -\-  fis-  l,f^s  -  l,n-\-  fip  -\-  fis,j^) 


04(0 


n     /  (n  +  lip  -  l)(n  +  fip  +  lis  -  I) 

where  F(a,b,c,z)  is  the  hypergeometric  function.  Finally 

smniipriiip  +11S-  l)(t  +  Tp^-^'t^' 


^4(0 


(1  -  iipWiiip  +  l)r(iis  -  l)(t  +  Ts^sti-^^P 
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{lip  -  l,llp  -  l,llp  +  lis  -  1},{/Xp,/Xp  +  1}, 


t^Ts 


r 


where  pFg({a},{b],z)  is  the  generalized  hypergeometric  func- 
tion. In  the  limit  for  ^  ^  00 

sinniiprdip  -\-  lis  -  I) 

~7t(iip-i)r(iip+i)r(iis-i) 

X  F  [{lip  -  l,iip-l,iip+iis  -  l},{/xp,/xp  +  l},l] . 

(BIO) 


As  far  as  the  evaluation  of  the  function  ((7(t))  is  concerned 
in  the  dynamic  approach  it  is  given  by  Eq.  (35) 


{cr(t))  =  €  f 
Jo 


fsit.t')^pit')dt' 


which  is  to  be  evaluated  in  the  parameter  range  iis  <  2,iip  < 
2.  Let  Ps(t)  =  Rs(t)  +  8(t),  Eq.  (Bll)  reads 


(crit)) 


Jo  Jo 


dxPs(x)\lfs(t  -  x),  (B12) 


which,  after  a  transformation  on  the  integration  domain,  turns 
into 


/' 

Jo 


{a(t))  =  €  /  dxxifsit  -  x)Ps(x) 


f  dt' 

J  X 


^p{t').  (B13) 


The  long-time  limit  shifts  the  main  weight  in  the  integral  on  the 
terms  such  that  x  <t.  Therefore  the  approximation  Eq.  (B7), 
valid  also  for  Psit)  in  the  long-time  limit,  can  be  adopted. 
(Bll)      Inserting  such  approximation  in  Eq.  (B 13)  leads  to 


{a(0)^-€^^^^  f  dxxifsit  -  x)x^^-^  f  dt'^pit'). 

7tT^'    ^  Jo  Jx 


(B14) 
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After  getting  rid  of  the  integral  in  t'  by  direct  integration 
of  ^pit'),  one  is  left  with  a  simple  convolution  which  is 
easy  to  analyze  in  Laplace  transform.  After  some  straight- 
forward algebra,  extracting  the  two  asymptotic  leading  terms, 


leads  to 

{G{t))  2^  kit^-^'  +  k2t^'-^'-\  (B15) 

that  is,  the  asymptotic  expression  of  Eq.  (41),  with  the 
coefficients  given  by  Eqs.  (42)  and  (43). 
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Application  to  Robot  Teams 

Control  of  an  autonomous  robot  team  with  the  aim  of  optimizing  network  performance 
is  discussed  in  this  paper;  the  authors  design  a  controller  to  ensure  availability 
of  communication  resources. 

By  Jonathan  Fink,  Student  Member  IEEE,  Alejandro  Ribeiro,  Member  IEEE,  and 
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ABSTRACT  |  In  this  paper,  a  system  architecture  to  provide 
end-to-end  network  connectivity  for  autonomous  teams  of 
robots  is  discussed.  The  core  of  the  proposed  system  is  a  cyber- 
physical  controller  whose  goal  is  to  ensure  network  connec- 
tivity as  robots  move  to  accomplish  their  assigned  tasks.  Due  to 
channel  quality  uncertainties  inherent  to  wireless  propagation, 
we  adopt  a  stochastic  model  where  achievable  rates  are 
modeled  as  random  variables.  The  cyber  component  of  the 
controller  determines  routing  variables  that  maximize  the 
probability  of  having  a  connected  network  for  given  positions. 
The  physical  component  determines  feasible  robot  trajectories 
that  are  restricted  to  safe  configurations  which  ensure  these 
probabilities  stay  above  a  minimum  reliability  level.  Local 
trajectory  planning  algorithms  are  proposed  for  simple  envir- 
onments and  leveraged  to  obtain  global  planning  algorithms  to 
handle  complex  surroundings.  The  resulting  integrated  con- 
trollers are  robust  in  that  end-to-end  communication  survives 
with  high  probability  even  if  individual  point-to-point  links  are 
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likely  to  fail  with  significant  probability.  Experiments  demon- 
strate that  the  global  planning  algorithm  succeeds  in  navigating 
a  complex  environment  while  ensuring  that  end-to-end  com- 
munication rates  meet  or  exceed  prescribed  values  within  a 
target  failure  tolerance. 

KEYWORDS  I  Mobile  ad  hoc  networks;  motion  planning;  multi- 
robot  systems 

I.  INTRODUCTION 

The  confluence  of  advances  in  wireless  communication, 
sensing,  computing,  and  control  has  led  to  the  emergence 
of  cyber-physical  systems.  Cyber-physical  systems  go  be- 
yond the  mere  addition  of  cyber  functionalities  to  physical 
systems  being  typified  by  a  tight  blending  of  cyber  and 
physical  components  and  functions.  A  canonical  example 
of  this  class  of  systems  is  the  autonomous  robot  team 
where  a  group  of  mobile  robots  cooperate  to  accomplish 
tasks  assigned  by  human  operators.  Consider,  for  example, 
a  search  and  rescue  mission  in  a  hazardous  environment 
where  a  team  of  robots  is  deployed  to  scout  points  of  in- 
terest. Designated  lead  members  of  the  team  move  to 
specified  locations  while  the  remaining  robots  provide 
mission  support  by  configuring  a  multihop  wireless  net- 
work that  permits  relaying  of  information  back  to  the 
operators.  Availability  of  wireless  communications  is 
critical  for  task  accomplishment  because  communications 
are  required  to  exchange  information  between  robots  as 
well  as  to  relay  information  to  and  from  the  human 
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Fig.  1.  System  architecture.  Task  specification  tiere  represents  a  generic  spatial  application  defined  by  a  convex  task  potential  function  ^(x) 
while  providing  a  stream  of  data  to  the  human  operator.  Individual  robot  components  consist  of  the  low-level  robot  control,  estimation, 
and  communication.  We  additionally  assume  that  a  subsystem  is  available  to  build  an  online  model  of  radio  communication  in  the  environment. 
The  focus  of  this  work  is  on  developing  concurrent  methods  for  routing  and  mobility  control. 


operators.  It  then  follows  that  deployment  of  the  team 
requires  integration  of  (cyber)  v^ireless  communications 
v^ith  (physical)  mobility  control.  The  cyber-physical 
perspective  posits  that  this  integration  is  not  just  a  matter 
of  adding  v^ireless  transceivers  to  the  robots  but  requires  a 
tight  conjoining  of  communications  and  mobility  control. 

Tight  integration  of  communications  and  mobility 
arises  in  autonomous  robot  teams  because  availability  of 
v^ireless  infrastructure  is  unlikely  in  the  harsh  environ- 
ments in  v^hich  they  are  to  be  deployed.  In  the  absence  of 
infrastructure,  robots  have  to  self-organize  into  a  v^ireless 
netv^ork  capable  of  supporting  the  necessary  information 
exchanges.  In  this  context,  the  spatial  placement  of  the 
robots  affects  the  feasibility  of  successful  communication 
because  some  physical  configurations  cannot  support  re- 
quired transmission  rates  betv^een,  say,  human  operators 
and  some  lead  members.  We  can  therefore  think  of  com- 
munications as  determining  a  set  of  virtual  obstacles  that 
restrict  the  movement  of  the  robots.  Hov^ever,  this  is  not  a 
hard  limit  as  it  is  possible  to  reconfigure  communication 
variables  to  alter  the  set  of  physical  configurations  for 
v^hich  communications  are  feasible.  This  yields  a  control 
loop  that  begins  v^ith  communication  variables  limiting 
mobility  and  finishes  v^ith  mobility  determining  commu- 
nication variables.  In  the  end,  successful  deployment  of  an 
autonomous  team  of  robots  requires  joint  cyber-physical 
controllers  that  determine  (physical)  trajectories  for  the 
robots  v^hile  ensuring  (cyber)  availability  of  communica- 
tion resources  [1]. 

An  architecture  diagram  for  the  cyber-physical  control 
of  autonomous  teams  of  robots  is  shov^n  in  Fig.  1.  As  v^ith 
any  mobility  control  system,  there  is  a  block  performing 


task  specification,  a  second  block  executing  the  control 
lav^,  and  a  third  block  conducting  actuation  and  state  esti- 
mation. The  task  specification  block  interfaces  v^ith  the 
human  operators  and  integrates  robot  observations  and 
requirements  to  determine  specifications  that  it  passes  on 
to  the  control  block.  These  specifications  come  in  the  form 
of  a  potential  function  ^(x)  and  basal  end-to-end  commu- 
nication rate  requirements  af^nin — subindex  i  here  de- 
notes a  source  and  the  superindex  k  a  destination;  see 
Section  II.  The  goal  of  the  autonomous  team  is  to  minimize 
the  potential  ^(x)  v^hile  ensuring  that  end-to-end  com- 
munication rates  exceed  the  minimum  required  level  af^nin- 

The  control  block  is  the  core  of  the  cyber-physical 
system  and  the  main  focus  of  the  v^ork  described  in  this 
paper.  The  purpose  of  this  block  is  to  determine  physical 
control  inputs  x(t)  that  dictate  the  movement  of  the  robots 
and  netv^ork  variables  Oi{t)  that  determine  the  communi- 
cation of  information  through  members  of  the  team.  The 
control  inputs  x(t)  are  conducive  to  task  completion,  i.e., 
minimization  of  the  task  potential,  v^hile  ensuring  reliable 
netv^ork  operation,  i.e.,  communication  at  or  exceeding 
rates  af^^in  when  combined  v^ith  netv^ork  variables  a.(t). 
Information  about  achievable  rates  betv^een  different  loca- 
tions is  necessary  to  determine  mobility  and  communi- 
cation variables.  This  information  is  provided  by  a  radio 
communication  modeling  block.  Due  to  the  difficulty  of 
predicting  these  rates  accurately,  the  output  of  this  block  is 
in  the  form  of  a  statistical  model  of  achievable  rates.  As  v^e 
discuss  in  Section  III,  models  that  provide  the  mean  and 
variance  of  achievable  rates  are  reasonable  in  practice. 

The  execution  block  consists  of  individual  robots 
implementing  the  control  lav^  x(t)  and  routing  packets 
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according  to  variables  a.(t).  Robots  also  take  observations 
yi{t),  e.g.,  a  video  feed,  that  they  relay  to  task  planning  and 
perform  position  estimation  Xi{t)  that  they  feedback  to  the 
control  block.  Using  available  technologies  for  mapping, 
control,  and  state  estimation,  each  robot  can  estimate  its 
position  Xi{t)  and  control  its  velocity  Xi{t)  with  respect  to  a 
common  known  map  of  the  environment.  Since  we  do  not 
provide  details  about  mapping  or  state  estimation  in  this 
paper,  we  refer  the  interested  reader  to,  e.g.,  [2]. 

A.  Communication-Aware  Mobility  Control 

In  the  design  of  concurrent  communications  and  mobi- 
lity control  algorithms  for  autonomous  robot  teams  there 
are  three  issues  that  arise  naturally:  1)  how  to  translate 
robot  positions  into  information  about  the  point-to-point 
rates  at  which  pairs  of  robots  can  communicate  with  each 
other;  2)  how  will  the  cyber-physical  control  loop  in  Fig.  1 
utilize  point-to-point  connectivity  information  in  order  to 
ensure  end-to-end  network  integrity;  and  3)  how  to  deal 
with  uncertainty  in  the  information  about  point-to-point 
channels  between  individual  pairs  of  robots. 

Issues  1)  and  2)  are  related  to  each  other.  A  simple 
answer  to  issue  1)  is  to  identify  the  ability  to  communicate 
with  spatial  proximity,  possibly  mixed  with  line-of-sight 
information  to  account  for  physical  obstacles  to  radio  pro- 
pagation. Using  this  model,  point-to-point  links  between 
robots  are  feasible  if  they  are  sufficiently  close  and  within 
sight  of  each  other  and  infeasible  otherwise.  Matched  to 
this  simple  model  of  point-to-point  connectivity,  the  abi- 
lity to  establish  end-to-end  connections  is  identified  with 
the  connectedness  of  the  resulting  graph.  The  problem  of 
ensuring  end-to-end  network  integrity  then  becomes  one 
of  letting  robots  move  to  accomplish  their  task  while 
guaranteeing  that  the  graph  of  point-to-point  connections 
consists  of  a  single  connected  component  [3]-[10]. 

The  advantage  of  this  approach  is  that  network-wide 
indicators  of  connectedness  such  as  the  second  largest 
eigenvalue  of  the  graph  Laplacian  or  the  fe-connectivity  of 
the  network's  graph  are  computationally  tractable.  It  is 
therefore  not  difficult  to  insert  connectivity  constraints 
into  the  task  planner  and,  if  there  is  leeway  in  the  selection 
of  different  spatial  configurations,  to  find  configurations 
that  optimize  connectivity  in  terms  of  the  metric  under 
consideration.  Do  notice  that  path  planning  is  computa- 
tionally challenging  even  in  the  absence  of  communication 
constraints.  The  tractability  of  graph  connectedness  indi- 
cators simply  means  that  the  computational  complexity  of 
the  cyber-physical  control  loop  is  comparable  to  that  of  a 
communication-unaware  controller. 

Aside  from  the  value  of  simplicity,  associating  small 
distance  and  line-of-sight  with  connectivity  is  not  entirely 
justified.  Early  solutions  to  wireless  networking  for  sys- 
tems of  dynamic  nodes  were  indeed  based  on  the  concept 
that  reliable  point-to-point  links  could  be  detected  and 
established  [11],  [12].  However,  it  was  quickly  noticed  that 
binary  channel  models  were  not  accurate  representations 


of  wireless  links  [13].  As  a  consequence,  attention  was 
devoted  to  methods  that  explicitly  consider  the  reliability 
of  point-to-point  links  and  choose  routes  to  optimize  end- 
to-end  reliability  metrics  such  as  probability  of  delivery 
[14]-[16].  These  more  nuanced  models  of  wireless  connec- 
tivity can  be  incorporated  without  much  difficulty  into 
mobility  control  algorithms  that  use  indicators  of  graph 
connectedness  [17],  [18].  If  links  are  characterized  by 
reliability  indicators,  the  binary  connectivity  graph  can  be 
replaced  by  a  weighted  graph.  Information  about  the 
number  of  connected  components  can  be  similarly  ex- 
tracted from,  for  example,  the  second  largest  eigenvalue  of 
the  weighted  graph's  Laplacian. 

Either  based  on  binary  or  weighted  graphs,  connected- 
ness is  an  indirect  indicator  of  the  ability  to  establish  end- 
to-end  communications.  Having  a  single  connected 
component  just  implies  that  multihop  paths  from  sources 
to  destinations  exist  but  does  not  determine  whether  the 
network  formed  by  the  robots  is  able  to  support  desired 
communication  rates.  It  is  possible  that  some  links  in  the 
connectivity  graph  can  support  rates  that  are  too  low,  or 
that  the  resulting  configuration  contains  bottlenecks  that 
limit  information  flow.  In  the  cyber-physical  control  loop 
depicted  in  Fig.  1,  graph  connectedness  does  not  solve  for 
the  network  variables  a.(t).  Instead,  it  restricts  movement 
so  that  a  necessary  but  not  sufficient  condition  for  the 
existence  of  feasible  network  variables  a.(t)  is  not  violated. 
In  fact,  one  could  argue  that  these  controllers  are  not 
cyber-physical  because  the  integration  of  wireless  commu- 
nications with  physical  mobility  is  minimal.  A  true  cyber- 
physical  design  should  consider  end-to-end  rates  and 
conjoin  the  design  of  mobility  and  network  control  as  done 
in  the  systems  described  in  the  following  section. 

B.  Robust  Cyber-Physical  Control 

An  honest  metric  of  network  integrity  has  to  rely  on  the 
achievability  of  target  end-to-end  communication  rates. 
Communication  rates,  however,  do  not  depend  solely  on 
the  spatial  configuration  but  also  on  the  manner  in  which 
packets  are  routed  through  the  network.  This  leads  to  the 
problem  of  joint  cyber-physical  control  of  trajectories  and 
routing  variables  [1],  [19].  For  given  spatial  configuration  x, 
pairs  of  achievable  point-to-point  communication  rates 
Rij(x)  are  assumed  available  at  the  radio  modeling  block. 
The  cyber  part  of  the  controller  queries  the  modeling  block 
and  attempts  to  determine  routing  variables  a.(x)  that 
support  desired  communication  rates  af^^in-  ^^^^  attempt 
is  unsuccessful,  spatial  configuration  x  is  marked  as  un- 
allowable. If  the  attempt  succeeds,  joint  configuration 
(x,  a.(x))  is  deemed  feasible  and,  in  particular,  spatial 
configuration  x  is  deemed  allowable.  The  physical  part  of 
the  controller  utilizes  the  information  on  allowable  con- 
figurations X  to  plan  physical  routes  that  achieve 
minimization  of  the  task  potential  ^(x)  while  ensuring 
that  end-to-end  rates  exceed  basal  requirements  at  all 
points  in  time.  See  Section  II. 
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Ensuring  continuous  reliable  communication  is  chal- 
lenging, however,  because  significant  uncertainty  in 
achievable  rates  is  inherent  to  the  deployment  of  an  auto- 
nomous robot  team.  Due  to  shadowing  and  small  scale 
fading,  even  small  variations  in  robots'  positions  lead  to 
significant  changes  in  channel  strength  [13],  [20],  [21] 
which  translate  into  commensurable  changes  in  achievable 
communication  rates  between  pairs  of  robots.  Precise 
channel  state  information  can  be  acquired  through  mea- 
surements over  time,  but  mobility  planning  algorithms 
necessitate  access  to  channel  quality  indicators  at  future 
positions  to  which  the  robots  are  yet  to  be  deployed.  Thus, 
from  the  perspective  of  the  system  in  Fig.  1,  rates  i^ij(x)  are 
random.  The  outputs  of  the  radio  modeling  block  are  rate 
estimates  -Rij(x)  whose  variances  -Rij(x)  are  typically  large. 
See  Section  III. 

Since  achievability  of  end-to-end  rates  cannot  be  gua- 
ranteed because  of  rate  uncertainties,  it  is  natural  to  rede- 
fine network  survivability  in  terms  of  the  probability  of 
end-to-end  rates  exceeding  their  basal  rates  afj^in-  Packet 
routing  variables  cx(x)  are  then  determined  in  order  to 
maximize  this  probability  [1].  The  resulting  routes  are 
robust  because  they  are  chosen  to  minimize  the  probability 
of  failure.  At  this  point  it  has  to  be  noted  that  the  goal  of 
the  self-organized  network  is  to  maintain  reliable  end-to- 
end  communication,  not  point-to-point  communication. 
Consequently,  we  can  think  of  robust  routes  as  exploiting 
spatial  redundancy  to  minimize  the  effect  of  point-to-point 
uncertainty  in  end-to-end  communication  rates.  Indeed, 
by  splitting  traffic  flows  between  various  neighboring  ro- 
bots, we  can  ensure  that  while  failure  of  a  particular  link 
may  reduce  end-to-end  communication  rates,  it  does  not 
interrupt  them  completely.  See  Section  IV. 

The  integration  of  robust  routing  with  the  physical  con- 
trol loop  is  discussed  in  Section  V.  We  develop  local  plan- 
ning algorithms  for  simple  environments  (Section  V-A)  that 
we  test  in  simulations  for  teams  composed  of  three  and  four 
robots  (Section  V-B).  These  simulations  demonstrate  that 
the  performance  of  local  controllers  is  hindered  by  local 
minima.  To  overcome  this  problem  as  well  as  to  handle 
complex  environments,  we  leverage  the  local  controllers  to 
design  global  planning  algorithms  (Section  V-C).  Global 
planners  are  tested  with  experimental  results  in  an  indoor 
office  space  at  the  University  of  Pennsylvania  (Section  V-D). 
The  experiments  demonstrate  that  the  global  planning 
algorithm  succeeds  in  navigating  a  complex  environment 
while  ensuring  that  end-to-end  communication  rates  meet 
or  exceed  prescribed  values  within  a  target  failure  tolerance. 
We  close  the  paper  with  concluding  remarks  (Section  VI). 

II.  COMMUNICATIONS  AND  MOBILITY 

Consider  a  team  of  N  robots  and  denote  their  positions  as 
Xi,  for  i  =  1, . . . ,  N.  The  robots  are  kinematic  and  fully 
controllable  which  allows  us  to  consider  simple  mobility 
models  of  the  form  Xi{t)  =  Ui(t),  where  Ui(t)  is  the  control 


input  to  robot  i.  A  human  operator  is  located  at  the  fixed 
operation  center  that  we  index  as  i  =  0  at  position  xq. 
Further  define  vectors  x  :=  (xq,  . .  .  G  R^*^^^^^  and 
X  :=  (xo, . . .  ,Xn)  G  R^*^^^^^  to  group  all  positions  and 
velocities,  respectively.  The  task  assigned  to  the  team  is 
specified  through  a  generic  scalar  convex  task  potential 
function  ^  :  R^^^+^^  R.  If  the  potential  minimum  ^min 
is  attained  at  x*,  i.e.,  if  ^(x*)  =  ^min.  the  configuration 
X*  satisfies  task  completion.  For  example,  if  a  designated 
leader  agent  i  must  visit  a  target  location  x^^goai  ^  R^,  we 
can  define  ^(x)  =  ||x^  —  x^^goai||^-  The  minimum  ^min  = 
0  is  attained  by  any  configuration  x*  =  (xq,  .  . . , x^, . . . , 
Xn)  for  which  Xi  =  x^^goah  or  equivalently  by  any  member 
of  the  set  x*  G  {x  =  (xq,  . . . , x^,  . . . , x^)  :  x^  =  x^,goai}. 
Irrespective  of  the  particular  form  of  ^(x),  the  control 
problem  is  to  find  velocities  x(t)  such  that  at  some  time  tf 
the  team  configuration  ^{tf)  =  x(0)  +  x(t)dt  satisfies 
task  completion  in  that  we  have  ^(x(t_^))  =  ^min- 
Mathematically,  we  can  write  this  mobility  control  formu- 
lation as 

min  ^(xftf)) 

t 

subject  to    x(t)  =  x(0)  +  J  x(u)  du.  (1) 

0 

As  robots  move  to  accomplish  their  task,  they  maintain 
end-to-end  data  communication  flows  between  members  of 
the  team  and/or  members  of  the  team  and  the  operation 
center.  Information  flows  are  indexed  as  fe  =  1, . .  . ,  K. 
Flows  may  have  multiple  sources  and  multiple  destina- 
tions. The  set  of  destinations  of  the  feth  information  flow  is 
denoted  as  dest(fe).  For  agent  i  and  flow  k,  the  variable 
min  represents  the  required  communication  rate  between 
agent  i  and  any  of  the  agents  in  the  set  of  destinations 
dest(fe).  For  example,  if  the  only  communications  of  inter- 
est are  from  the  lead  robot  i  to  the  operation  center,  there 
are  only  K  =  1  flows.  Since  the  flow  fe  =  1  is  intended  to 
the  operating  center,  dest(l)  =  0  and  ^-^  denotes  the 
minimum  level  of  service  for  the  communication  from  the 
leader  to  the  operating  center.  All  other  variables 
«tmin  =  0  are  null. 

We  model  point-to-point  connectivity  through  a  rate 
function  Rij(x)  =  Rij(xi,Xj)  that  determines  the  amount  of 
information  that  agent  i  at  position  Xi  can  send  to  agent  j  at 
position  Xj.  Since  direct  communication  between  the 
source  and  the  destination  of  an  information  flow  is  not 
always  possible,  terminals  self-organize  into  a  multihop 
network  to  relay  packets  for  each  other.  Packet  relaying  is 
determined  by  routing  variables  a^j  which  describe  the 
fraction  of  time  node  i  spends  transmitting  data  for  flow  k 
to  node  j;  see  Fig.  2.  Thus,  the  product  a^jRij{x.)  determines 
the  rate  of  point-to-point  information  transmission  from 
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task  specifications  in  (1)  and  (3).  Reducing  ^(x)  as  per  (1) 
and  ensuring  network  integrity  as  per  (3)  may  be 
conflicting  requirements.  We  therefore  replace  (1)  by  a 
concurrent  search  of  trajectories  x(t)  and  routes  Oi{t)  so 
that  the  task  potential  is  minimized  without  ever  breaking 
communication  connectivity.  Mathematically,  we  write 
this  objective  as  the  optimization  problem 


Fig.  2.  Communication  networii,  Ttie  nodes  are  deployed  to  support 
end-to-end  rates  from  node  i  to  destination  (or  flow)  k.  Routing 
variables     determine  the  fraction  of  time  node  i  sends  packets  to 
node  j  for  flow  k.  Rjj  is  defined  as  the  supported  rate  of  the  wireless 
channel  from  node  i  to  node  j. 


i  to  ].  If  we  consider  the  transmission  to  all  neighboring 
terminals  for  which  Rij(x)  >  0,  the  total  rate  at  which  pa- 
ckets leave  agent  i  is  Ylf=o  ^^j^iji^)-  Likewise,  the  total  rate 
at  which  i  receives  packets  from  other  terminals  is 
.j=om^st{k)^^M^)'  The  information  rate  af(a,x) 
available  for  flow  k  at  source  i  is  the  difference  between 


avail  al 

outgoing  and  incoming  rates 


af(a,x)=  ^4Ri,.(x)  -  c^M^) 

i=0  ]=O,]0dest(fe) 


(2) 


Outgoing  packets 


Incoming  packets 


where  we  define  the  vector  a.  grouping  all  routing 
variables  a^j.  Notice  that  the  variables  a^j  represent  time 
slot  shares  and  must  therefore  satisfy  0  <  a^j  <  1  for  all  i,  j, 
and  k.  It  must  also  be  that  a^-  <  1  for  all  i  to  ensure 
that  the  sum  of  all  time  shares  at  terminal  i  does  not  exceed 
1.  It  is  possible  to,  alternatively,  require  j  o^j  ^  1  if  we 
wish  for  only  one  link  to  be  active  at  any  time  across  the 
entire  network. 

Routing  variables  a.  and  configuration-dependent  rates 
Rij(x)  determine  the  set  a^{oi^x.)  of  end-to-end  commu- 
nication rates  from  each  node  i  and  flow  k  as  per  (2).  The 
task  specification  requires  that  end-to-end  rates  exceed 
the  minimum  threshold  afi^in-  Therefore,  integrity  of  the 
communication  network  necessitates  that 


for  all  i,  k. 


(3) 


Notice  that  a^{oi^  x)  is  a  function  of  positions  x  and  rout- 
ing variables  a..  To  control  end-to-end  connectivity,  i.e.,  to 
satisfy  (3),  we  can  resort  to  control  of  positions  x,  to 
control  of  routes  a.,  or  both. 

Since  communication  is  necessary  for  task  completion, 
the  mobility  control  problem  as  summarized  in  (1)  is 
redefined.  The  new  goal  is  to  find  algorithms  and  control 
policies  that  govern  robot  motions  in  order  to  satisfy  the 


min  ^(x(tf)) 


subject  to        af(a(t),x(t))  >af  . 


r 

x(t)  =  x(0)  +  j  x(u)  du 


(4) 


where  rates  aj^(a.(t), x(t))  are  given  by  the  expression 
in  (2)  with  a.  =  a(t)  and  x  =  x(t). 

A  drawback  of  the  formulation  in  (4)  is  the  difficulty  of 
ensuring  that  the  constraints  in  (3)  are  satisfied.  As  seen 
in  (2),  rates  (a,  x)  depend  on  the  link  reliabilities  Rij(x), 
which  are  difficult  to  estimate.  Propagation  knowledge  is 
used  by  the  radio  communication  modeling  block  in  Fig.  1 
to  provide  rough  rate  estimates  Rij(x).  Agents  can  further 
measure  received  signal  strength  indicators  (RSSIs)  to 
refine  the  rate  estimates  for  the  current  configuration  x(t); 
see,  e.g.,  [22].  The  challenge  here  is  that  Rij(x)  estimates 
are  needed  not  only  for  configuration  x(t),  but  for  nearby 
configurations  to  which  the  robots  may  move.  The  high 
variability  of  wireless  channels  makes  Rij(x(t))  a  poor 
predictor  of  Rij(x)  even  if  x  is  close  to  x(t).  In  general, 
channel  estimates  Rij(x)  at  future  positions  x  follow  from 
radio  propagation  modeling  and  corrections  from  RSSI 
observations. 

In  formal  terms,  uncertainty  of  channel  estimates 
means  that  the  variances  Rij  (x)  of  channel  estimates  Rij  (x) 
are  typically  large  for  possible  future  positions  x. 
Variances  Rij(x)  can  be  provided  by  the  radio  modeling 
block  and  are  therefore  assumed  to  be  available  at  the 
controller  along  with  the  mean  estimates  Rij(x).  We  seek 
to  redefine  (4)  in  a  manner  that  takes  into  account  this 
probabilistic  formulation  of  channel  rates.  The  important 
observation  here  is  that  if  point-to-point  link  rates  become 
random,  so  do  the  rates  a^{oi^x.)  of  end-to-end  commu- 
nication flows  [cf.  (2)].  Consequently,  it  is  not  possible  to 
guarantee  satisfaction  of  the  constraints  in  (3).  Rather,  we 
introduce  a  reliability  tolerance  e  and  require  that  for  all  i 
and  k 


>  e. 


(5) 


That  is,  we  require  that  the  end-to-end  link  between  all 
sources  i  and  the  destinations  of  all  corresponding  flows  k 
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exceed  their  minimum  required  level  of  service  with  pro- 
bability larger  than  e.  As  in  the  case  of  the  rate  require- 
ment in  (3),  we  can  satisfy  (5)  by  controlling  a.  and  x 
separately  or  jointly. 

In  order  to  robustly  satisfy  the  networking  constraints 
in  (5),  the  concurrent  routing  and  mobility  problem  (4)  is 
replaced  by 


min  ^(x(tf)) 

a(t),x(t),te[0,t;]      ^  '^'^ 


subject  to 


>  e 


af(a,x)  >al^.^ 
t 

x(t)  =  x(0)  +  J  x(u)  du. 


(6) 


The  focus  of  this  paper  is  the  solution  of  (6). 

The  problem  formulation  in  (6)  inherits  some  standard 
complications  from  the  control  formulation  in  (1).  The 
problem  is  infinite  dimensional  and  due  to,  e.g.,  the 
presence  of  obstacles,  not  convex.  The  concurrent  search 
in  (6)  is  further  complicated  by  the  entanglement  of  the 
routing  and  mobility  problems.  We  deal  with  this  en- 
tanglement by  fixing  x  and  selecting  a.  in  a  manner  that 
optimizes  the  reliability  P[af  (a,  x)  >  a^^^^]  (Section  IV). 
We  finish  with  local  and  global  searches  on  positions  x 
to  minimize  ^(x)  while  keeping  reliabilities  above  the 
e  threshold  (Section  V). 


signal  strength  PR(xi,  Xj)  that  we  cascade  into  models  of  the 
packet  error  rate  and  the  supported  communication  rate 

Received  signal  power  P^^Xi^Xj)  is  determined  by  three 
phenomena:  path  loss  due  to  the  distance  from  the  source, 
shadowing  due  to  obstacles  in  the  propagation  path,  and 
multipath  fading  that  arises  as  a  result  of  reflections  and 
refractions.  Of  these  three  phenomena,  path  loss  and  sha- 
dowing can  be  incorporated  into  a  predictive  model  with 
relative  ease.  Fading,  however,  is  difficult  to  predetermine. 
Multipath  fading  arises  due  to  a  spatial  wave  interference 
pattern  generated  by  reflections  and  refractions  of  the 
electromagnetic  wave  which,  as  a  consequence,  inherits  a 
space  constant  in  the  order  of  the  wave's  wavelength. 
Thus,  in  order  to  use  ray  tracing  models  to  predict  the 
interference  pattern  we  need  to  locate  all  potential 
reflectors  and  refractors  with  precision  smaller  than  this 
wavelength.  While  plausible  in  principle,  this  is  intractable 
in  practice  because  wavelengths  at  common  operating  fre- 
quencies are  too  small — for  example,  with  radios  operating 
at  2.4  GHz.,  the  wavelength  is  about  12  cm.  Given  this 
difficulty  we  adopt  the  following  model  for  the  received 
power  PR,dBm(^i,  ^])  =  10  log(PR(xi,  ^j))  measured  in  dBm: 


PR,dBm{Xi,Xj) 

=  Lo  -  lOn  •  log(||xi  -XjW 

Path  loss 


(7) 


Shadowing  F^^i^g 


III.  POINT-TO-POINT  RATE  MODELING 

With  robots  i  and  j  located  at  positions  Xi  and  Xj,  we  seek  to 
develop  a  probabilistic  model  for  the  supported  communi- 
cation rate  Pij(x)  =  Rij{xi,Xj)  between  them.  More  specif- 
ically, we  wish  to  model  its  expected  value  Pij(x)  = 
Rij{xi^Xj)  and  variance  Pij(x)  =  Rij(xi,Xj).  Our  intent  is  to 
use  simple  radios  that  do  not  perform  rate  or  power 
adaptation — ^we  use  2.4-GHz  Zigbee  radios  in  our  experi- 
ments; see  Section  V-D.  In  that  case,  the  communication 
rate  Pij(xi,Xj)  is  a  function  of  the  packet  error  rate  of  the 
channel,  which  in  turn  is  a  function  of  the  signal-to-noise 
ratio  [23].  We  therefore  focus  on  models  of  the  received 


where  the  term  ^  is  a  zero-mean  Gaussian  random  varia- 
ble with  variance  cr^  modeling  fading  effects.  The  term  Lq 
is  the  measured  power  at  a  reference  distance  do  from  the 
source,  n  is  a  path  loss  exponent,  and  W(xi,Xj)  is  a  non- 
smooth  function  to  model  shadowing  as  a  function  of  the 
number  of  obstacles  between  source  and  destination. 
Experimental  data  collected  in  an  indoor  office  environ- 
ment at  the  University  of  Pennsylvania  along  with  the 
function  in  (7)  having  parameters  fit  to  the  experimental 
data  are  depicted  in  Fig.  3.  The  parameters  in  Fig.  3  are 
Lq  =  —51  dBm  for  do  =  1  m,  n  =  2.1,  W(xi,Xj)  =  0  for 
line-of-sight  links  and  W(xi,Xj)  =  7.6  dB  for  non-line-of- 
sight  links,  and  cr^  =  32  dB^.  Notice  that  fading  can  cause 


RSS  (dBm)  RSS  (dBm) 


Distance  (m) 


(a) 


Fig.  3.  Experimental  characterization  of  received  power  model  (7)  based  on  1000  samples  when  we  classify  wireless  channels  based  on 
line-of-sight.  (a)  Line-of-sight,  (b)  Non-line-of-sight. 
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variations  in  received  power  in  the  order  of  ±10  dB.  This 
is  verified  by  the  standard  deviation  of  T ,  which  is  gj:  ^ 
5.7  dB.  Recall  that  a  10-dB  difference  corresponds  to  a 
change  of  an  order  of  magnitude  for  the  power  measured 
in  linear  units. 

Translation  of  received  signal  strength  PR(xi,Xj)  into 
packet  error  rate  pe(PR(xi, Xj))  depends  on  the  type  of 
modulation  and  the  choice  of  error  correcting  codes. 
Regardless  of  specifics,  a  generally  good  approximation  to 
the  packet  error  rate  is 


p,(PR(xi,Xj))  =erfcL/k^^  (8) 


where  P^^  is  the  noise  power,  erfc(x)  is  the  complemen- 
tary error  function,  and  fe  is  a  constant  that  depends  on 
modulation  and  coding  [23].  The  translation  into  commu- 
nication rates  Rij(xi,Xj)  is  now  straightforward  as  it  simply 
requires  multiplying  the  information  rate  of  transmitted 
packets  Rq  by  the  probability  of  successful  decoding,  i.e., 

Rij{Xi,Xj)  =  Ro[l  -  Pe{PR{Xi,Xj))]. 

The  randomness  of  the  fading  term  J-"  in  (7)  propagates 
into  randomness  in  the  error  probabilities  pe(P(xi,Xj))  and 
subsequently  into  randomness  in  the  rates  Rij(xi,Xj).  To 
compute  the  mean  and  variance  of  Rij(xi,Xj)  we  need  to 
recall  that  ^  is  a  Gaussian  random  variable.  We  can  then 
invoke  the  delta  method  [24]  to  approximate  the  mean  and 
variance  of  Rij(xi,Xj)  in  terms  of  the  mean  and  variance  of 
PR(xi,Xj)  and  the  derivatives  of  the  function  pe{PR{xi^Xj)) 
in  (8).  The  complete  mapping  from  distance  to  link  rates  is 
depicted  in  Fig.  4.  Observe  that  except  for  very  short 
distances  smaller  than  3  m  there  is  significant  variability  in 
achieved  link  rates.  See  [1,  ch.  4]  for  details. 

While  the  model  we  present  in  this  section  describes 
the  statistics  of  the  supported  rate  Rij(xi,Xj)  in  terms  of 
distances  ||xi  —  Xj||,  we  emphasize  that  this  is  not  a  limi- 
tation of  the  proposed  cyber-physical  controller.  In  fact. 


Normalized  Rate 


Distance  (m) 


Fig.  4.  Mapping  from  distances  -  xy  ||  between  a  transmitter  at 
location  x-,  and  a  receiver  at  position  xj  to  normalized  link  rates 

Rjj{x„Xj)/Ro. 


the  controller  is  agnostic  to  the  particulars  of  the  radio 
propagation  model  and  relies  only  on  the  ability  to  query 
for  means  and  variances.  It  is  reasonable  to  expect  that 
techniques  for  spatial  mapping  of  point-to-point  communi- 
cation rates  will  outperform  distance-based  models  when 
considering  unknown  or  highly  dynamic  environments. 
However,  as  demonstrated  by  the  experiments  in 
Section  V-D,  the  robust  satisfaction  of  communication 
constraints  affords  the  possibility  to  rely  on  coarse  models 
of  point-to-point  communication  rates  while  still  ensuring 
maintenance  of  quality-of-service  requirements. 

IV.  ROBUST  ROUTING 

The  control  loop  in  Fig.  1  embeds  a  cyber  optimization 
component  and  a  physical  reconfiguration  module.  The 
cyber  optimization  component  optimizes  communication 
routes  for  given  positions,  while  the  physical  component 
explores  spatial  reconfigurations  that  maintain  network 
integrity  and  are  conducive  to  task  achievement.  The  phy- 
sical module  relies  on  the  cyber  module  for  the  determi- 
nation of  configurations  that  are  feasible  from  the 
perspective  of  maintaining  target  communication  rates 
with  some  desired  reliability. 

The  major  difficulty  in  designing  the  cyber  component 
of  this  control  loop  is  the  uncertainty  in  achievable  trans- 
mission rates  between  nearby  agents.  As  we  discussed  in 
Section  III,  simple  experiments  suffice  to  demonstrate  that 
achievable  rates  in  wireless  links  are  difficult  to  predict. 
Therefore,  assuming  that  the  actual  channel  rates  Rij(x) 
coincide  with  their  estimates  Rij(x)  may  result  in  a  drastic 
difference  between  predicted  and  actual  end-to-end  rates. 
This  mismatch  subsequently  leads  to  situations  in  which 
the  communication  optimization  module  asserts  network 
integrity  when  actual  rates  do  not  suffice  to  support  the 
task  assigned  by  operators. 

A  simple  way  to  account  for  the  uncertainty  in  Rij  (x)  is 
to  discount  Rij(x)  in  order  to  reduce  the  likelihood  of 
having  actual  rates  smaller  than  the  assumed  value.  This  is 
definitely  possible  but  would  result  in  underutilization  of 
communication  resources.  A  better  way  to  account  for  the 
uncertainty  in  channel  rates  is  to  recall  that  end-to-end 
rather  than  point-to-point  failures  are  relevant.  With  re- 
ference to  Fig.  2,  say  that  robot  i  decides  to  convey  infor- 
mation to  fe  by  forwarding  it  through  L  If  the  rate  Rii(x) 
happens  to  be  substantially  smaller  than  estimated,  com- 
munication is  interrupted.  If,  on  the  contrary,  robot  i  de- 
cides to  send  part  of  the  information  through  I  and  some 
other  part  through  j,  the  communication  between  i  and  In 
survives  as  long  as  the  rate  Rij(x)  is  larger  than  estimated. 
This  example  illustrates  that  it  is  possible  to  exploit  spatial 
redundancy  through  traffic  splitting  in  order  to  devise  ro- 
bust routes  that  guarantee  small  changes  in  end-to-end  rates 
despite  large  variability  in  point-to-point  rates  Rij(x)  [25]. 

To  develop  robust  routing  algorithms,  start  by  noticing 
that  computing  the  probability  in  (5),  which  is  part  of  the 
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problem  formulation  in  (6),  necessitates  modeling  the 
probability  distribution  of  a^{oi^x.).  This  is  difficult  in 
general.  However,  if  we  explicitly  consider  the  stochastic 
model  of  point-to-point  links  via  their  means  and  va- 
riances, we  can  compute  the  mean  and  variance  of  end-to- 
end  rates  a^{oi,x.)  as 


df(a,x)  :=E[af(a,x)] 

=  X^a5.R,-(x)-    Yl   ^M^)  (9) 

]■  ]0dest(fe) 

aj^(a,x)  :=  var[af(a,x)] 

=  E(4)%w+  E  (10) 

]■  ;0dest(fe) 


A  proxy  for  the  probability  in  (5)  is  the  difference  between 
a^{oi^  x)  and  its  mean  a^{oi^  x)  normalized  by  its  standard 

deviation  ^J~^^(oi^x).  Indeed,  a  typical  approximation  for 
the  probability  constraint  in  (5)  is  the  condition 


(cx,  x)  :=   >  ^-\e)  (11) 

5f(a,x) 


for  some  function  ^~^{e).  If  the  probability  distribution  of 
rates  Rij(x)  is  Gaussian,  (11)  is  equivalent  to  (5)  if  ^~^(e)  is 
the  inverse  of  the  normal  distribution's  cumulative 
distribution  function.  For  other  probability  distributions, 
we  can  apply  Chebyshev's  inequality  with  ^~^{e)  =  y^l/e 
to  demonstrate  that  (11)  is  a  sufficient  condition  for 
satisfying  (5)  though  this  will  usually  be  a  conservative 
approximation. 

For  given  positions  x,  the  goal  of  the  cyber  optimiza- 
tion component  is  to  find  routing  variables  a.  that  satisfy 
(11)  which  is  equivalent  to,  approximate  to,  or  a  sufficient 
condition  to  satisfy  (5).  In  either  case,  there  is  some  inde- 
terminacy because  there  is  a  nonunique  set  of  variables  a. 
that  satisfy  the  corresponding  inequality.  This  indetermi- 
nacy provides  a  degree  of  freedom  that  can  be  used  to 
increase  the  reliability  beyond  the  required  level.  For  a 
given  configuration  x,  we  would  like  to  find  routes 
a.  =  a.(x)  that  provide  the  maximum  possible  reliability. 
Doing  so  is  not  conceptually  difficult — it  requires  determi- 
nation of  routes  a.(x)  that  maximize  the  probability  pro- 
xies Pf(a,x)  in  (11)  in  some  sense.  Considering  that  there 
is  a  rate  variable  for  each  source-destination  pair,  there 
are  several  possible  choices  to  define  this  maximization. 
One  possibility  is  to  maximize  the  sum  of  these  probability 
proxies.  Another  possibility  is  to  select  routes  a.(x)  that 
make  the  smallest  of  these  terms  as  large  as  possible. 
Maximizing  this  minimum  implies  that  the  constraints  in 
(11)  are  satisfied  with  significant  slack  and  that  there  is 


significant  liberty  to  change  the  physical  configuration 
without  violating  communication  constraints.  This  free- 
dom of  movement  facilitates  implementation  of  the  physi- 
cal mobility  control  block  as  we  discuss  in  Section  V. 

An  important  observation  here  is  that  the  probability 
proxy  constraints  in  (11)  can  be  written  in  a  manner  that 
defines  a  cone  in  the  joint  space  of  routing  variables  and 
end-to-end  rates  [1,  ch.  5].  A  consequence  of  this  property 
is  that  the  optimization  problems  that  need  to  be  solved  to 
maximize  them  are  second-order  cone  program  (SOCPs). 
SOCPs  are  a  particular  class  of  convex  optimization 
problem  that  can  be  solved  by  efficient  polynomial-time 
algorithms.  For  the  problems  considered  here,  the  compu- 
tational complexity  of  these  algorithms  is  represented  as  a 
polynomial  function  of  the  number  of  agents  N  and  the 
number  of  flows  (destinations)  K,  as  0((JC  •  N^)  '  ).  In 
practical  implementations,  the  term  can  be  reduced  by 
eliminating  links  for  which  rate  estimates  5ij(x)  are  below 
a  certain  threshold. 

Recall  that  the  motivation  for  robust  routing  algorithms 
is  to  reduce  uncertainty  in  end-to-end  communication 
rates.  We  do  so  by  taking  advantage  of  spatial  redundancy, 
for  which  it  is  necessary  to  split  traffic  among  various 
different  routes.  It  is  fitting  that  we  do  expect  to  obtain  this 
type  of  solution  from  the  maximization  of  the  probability 
proxies  P|^(a.,  x)  in  (11).  Indeed,  increases  in  these  terms 
can  be  brought  about  by  either  increasing  the  mean 
d^{oi^x.)  or  decreasing  the  variance  a|^(a.,x).  Since  the 
mean  is  a  linear  function  of  a.,  traffic  splitting  has  a  minor 
effect  on  aj^(a.,x).  However,  traffic  splitting  reduces 
aj^(a.,x)  by  a  factor  proportional  to  the  splitting  because 
the  variance  is  a  quadratic  function  of  a. — recall  that 
c^ij  ^  1-  Thus,  traffic  splitting  tends  to  increase  the  proba- 
bility proxies  because  it  keeps  d^{oi^x.)  more  or  less  con- 
stant and  reduces  a|^(a.,x)  significantly. 

V.  CONTROL 

As  per  (6),  the  objective  of  mobility  control  is  to  decrease 
^(x)  while  satisfying  the  probability  proxy  constraints  in 
(11) — ^which  are  equivalent  or  approximations  to 
P[af(a,x)  >  af^-^]  >  e.  To  check  for  the  feasibility  of  a 
cyher-physical  configuration  (a.,x),  we  define  the  prob- 
ability margin  as  the  minimum  slack  in  probability  proxy 
constraints  across  all  flows  and  sources 


z^(cx.,  x)  :=  min 


af(a,x)  -af^ 


5f(a,xO 


(12) 


Notice  that  a  necessary  and  sufficient  condition  for 
feasibility  of  the  physical  configuration  x  is  to  have 
z/(a.(x),x)  >  0,  with  routing  variables  a.(x)  as  given  by 
the  solution  of  the  SOCP  described  in  Section  IV.  A 
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sufficient  condition  for  feasibility  of  physical  configuration 
is  the  existence  of  a  cyber  configuration  a.  for  which 
iy{oi^  x')  >  0.  In  particular,  for  x'  close  to  x,  we  expect  to 
have  z/(a.(x),  x')  >  0  since  the  channel  statistics  at  x  and 
x'  are  close. 

A.  Gradient  Controller 

In  general,  gradient  controllers  define  velocities  x(t) 
proportional  to  the  negative  gradients  —  V^(x(t))  of  the 
task  potential.  Since  the  problem  in  (6)  is  subject  to  com- 
munication constraints,  a  local  controller  will  be  based  on 
potential  gradients  projected  onto  the  feasible  set 
z/(a.,  x)  >  0.  The  complex  description  of  the  feasible  set, 
however,  precludes  computation  of  projected  gradients. 
Instead,  we  consider  the  probability  margin  i/(a.(x),x) 
and  modify  the  potential  ^(x)  by  adding  the  probability 
margin  constraint  into  the  objective  through  a  barrier 
function 

l^(x)  :=  *(x)  -  log(z.(a(x),  x)).  (13) 

Since  nonnegativity  is  necessary  and  sufficient  for  feasibi- 
lity of  physical  configuration  x,  the  potential  Q{x.)  in  (13) 
is  defined  if  and  only  if  physical  configuration  x  is  feasible. 

The  local  control  law  is  defined  to  implement  gradient 
descent  on  the  modified  potential  introduced  in  (13), 
which,  in  explicit  terms,  is  given  by 

u(0  =  -V^(x(t))+^f[^^^  (14) 
^[a(x(t)),x(t)] 

The  term  V^(x(t))  in  (14)  drives  the  system  to  satisfy  the 
task  potential.  The  term  VxZ^[oL(x(t)), x(t)]/i/[a(x(t)), 
x(t)]  serves  as  a  barrier  that  drives  robots  away  from  con- 
figurations for  which  there  is  a  low  probability  of  exceed- 
ing the  desired  reliability  in  end-to-end  rates. 

B.  Simulation  Results 

We  implement  the  local  controller  with  mobility  con- 
trol inputs  given  by  (14)  and  communication  variables 
obtained  from  Section  IV.  Computing  controls  based  on 
local  optimization  of  the  network-level  end-to-end  rates 
allows  for  a  method  of  realizing  team  deployment  while 
maintaining  the  necessary  level  of  network  connectivity. 
Fig.  5  depicts  an  example  deployment  with  three  robots  for 
a  time-varying  task  potential 

(  X2,goal  =  (4,0),  t<40 
^(x(t))  =  <^  X2,goal  =  (8,0),     t<  60  (15) 
[x2,goal  =  (6,5),  t>60. 


X(m) 


Fig.  5.  Deployment  via  local  control  law  (14)  for  a  system  with  a 
fixed  access  point,  relay  node  Xi  and  lead  node  X2,  which  is 
controlled  by  a  time-varying  task  potential  ^(x(t)). 


Most  importantly,  this  example  demonstrates  conver- 
gence of  the  task  potential  ^(x)  while  maintaining 
P[ai  >  Qi^niin]  >  ^  as  depicted  in  Fig.  6(a)  and  (c).  How- 
ever, it  is  also  interesting  to  observe  that  when  the  task 
potential  is  minimized,  e.g.,  t  <  30  s,  the  local  control  law 
(14)  maximizes  the  probability  of  each  end-to-end  rate 
exceeding  its  minimum  threshold.  When  the  task  potential 
switches  so  that  ^(x)  is  no  longer  minimized,  the  proba- 
bility margin  is  reduced  so  that  the  primary  objective, 
minimization  of  ^(x),  is  prioritized.  Finally,  Fig.  6(b)  de- 
picts the  end-to-end  rate  of  the  node  X2  that  must  remain 
above  a2,min  =  0.1.  Remember  that  -Rij(x)  is  a  stochastic 
rate  that  affects  the  end-to-end  rate.  The  envelope  around 
0.2  in  Fig.  6(b)  depicts  the  effect  that  different  realizations 
of  communication  channels  Rij  (x)  will  have  on  the  end-to- 
end  rate.  Since  the  pursuit  of  minimization  on  ^(x)  is 
constrained  to  have  a  probability  margin  i/(a(x),x)  >  0, 
the  end-to-end  rate  exceeds  its  threshold  in  the  presence  of 
deviations  to  -Rij(x). 

We  perform  a  four  robot  simulation,  depicted  in  Fig.  7, 
to  demonstrate  how  the  complexity  of  the  objective  func- 
tion (13)  increases  with  more  nodes.  Similar  to  the  three 
robot  simulation  above,  the  task  potential  function  is  time 
varying  so  that  ^(x)  =  0  for  t  <  50  s  and  ^(x)  =  ||x3  — 
(9,  0)  II  for  t  >  50  s.  When  t  <  40  and  the  task  potential  is 
already  minimized,  control  of  xi  and  X2  is  based  solely  on 
the  maximization  of  log z/(a.(x),  x).  From  the  symmetric 
initial  configuration,  the  maximization  drives  Xi  and  X2 
towards  a  local  maxima  where  they  would  be  positioned  at 
the  same  point.  In  fact,  if  we  consider  all  configurations 
where  Xi  <  X2  and  both  agents  are  on  the  line  connecting 
the  access  point,  we  find  that  there  are  two  local  maxima  as 
depicted  in  Fig.  8. 

To  further  illustrate  the  virtual  terrain  of  the  objective 
function  (13)  that  drives  local  control,  we  observe  the 
convergence  of  ^(x(t))  as  depicted  in  Fig.  9(a).  For 
times  70  s  <  t  <  90  s,  convergence  of  ^(x(t))  slows, 
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Fig.  6.  Performance  of  the  local  control  law  (14)  demonstrating 
convergence  of  ^(x(t))  In  (a),  the  maintenance  of  expected  end-to-end 
rate  greater  than  the  threshold  of  a2,mm  =  O.l  In  (bh  and  the 
P[a/  >  cK,,min]  >  O.6.  The  envelope  surrounding  E[a2]  In  (b)  depicts  the 
60%  confidence  Interval  for  realizations  of  the  end-to-end  rate 
with  stochastic  /?,y(x). 


i/(a.(x),x)  as  depicted  in  Fig.  9(c)  shows  little  change, 
and  xi  and  X2  cease  progress.  Fortunately,  perturbations  in 
the  simulation  are  enough  to  pull  the  system  out  of  the 
local  minima  so  that  it  can  achieve  a  global  minimum  at 
t  =  140  s.  As  the  number  of  agents  increases,  the  fre- 
quency of  local  minima  in  — z^(a.(x),  x)  becomes  more  and 
more  of  an  issue  for  local  control.  The  addition  of  obstacles 
into  the  environment  adds  further  difficulties  as  it  not  only 
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Fig.  8.  Evaluation  of  i^(ot(x),x),  Vx  e  {x  =  (xi,X2,X3) :  Xi  <  X2  <  X3, 
X3  =  (5, 0),xi,  and  X2  online  connecting  access  point  and  X3}. 
Local  maxima  occur  at  xi=X2=  1.9  m  and  xx  =  IS  m,X2  =  3  m. 


affects  feasible  configurations  x  due  to  collision  con- 
straints, but  also  introduces  nonsmooth  components  in  the 
underlying  point-to-point  communication  links  i^ij(x). 

C.  Global  Planning 

Gradient-based  control  will  drive  the  system  towards 
local  minima  of  (13).  However,  as  we  consider  larger 
teams  with  more  complicated  network  topologies  and 
complex  environments  with  obstacles,  local  minima  be- 
come more  of  an  issue.  We  propose  that  a  global  search  of 
(6)  is  necessary  in  order  to  accomplish  the  high-level 
situational-awareness  tasks  we  are  interested  in. 

To  consider  global  search  of  (6),  we  redefine  the  prob- 
lem to  be  more  amenable  to  motion  planning  approaches 
from  the  robotics  literature.  Let  X  be  a  bounded  connected 
open  subset  of  R^^  that  represents  the  full  joint  state  space 
for  the  team  of  robots  where  Xq  is  the  initial  configuration 
of  the  team.  In  general,  the  goal  region  will  be  defined  as 
Xg  =  {x  :  ^(x)  <  ^min  +  In  the  telepresence  applica- 
tion where  a  lead  agent  must  visit  the  location  x^^g,  ^(x)  = 
11^^  ~  ^^,gll^  ^nd  Xg  =  {x  :  —  x^^gll  <  6}.  The  obstacle 
region  Xobs  contains  any  configuration  that  places  an 
individual  robot  on  a  physical  obstacle  and  the  infeasible 
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Fig.  7.  Snapshots  from  a  four  robot  trial.  The  end-to-end  rates  and  probability  of  meeting  the  problem  specifications  are  depicted  In  Fig.  9. 

^(x(f))  =  \\X3-  (9, 0)\\fort>50s  and  ^(x(t))  =  Ofort<50  s:  (a)  t  =  40  s;  (b)  t  =  75  s;  (c)  t  =  140  s. 
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Fig.  9.  (a)  Convergence  of  ^(x),  (b)  end-to-end  rates,  and 

(c)  probability  of  success  for  ttie  four  robot  trial  depicted  in  Fig,  7. 


region  represents  configurations  where  it  is  infeasible  to 
satisfy  the  network  constraint  (11) 


^inf  =  < 


X  :  ,  —  <  <l>  (e) 


/af(a,x) 


V  a  e  <!  a;;.  :  0  <  a^.  <  1 


•  (16) 


explore  the  workspace,  adding  states  in  a  tree  structure  T 
until  a  point  x  G  Xg  is  added  to  the  tree  T.  At  each  step 
of  the  RRT  algorithm,  we  pick  a  random  state  x  = 
RANDOMSTATE(X,  T)  and  select  the  point  x^^in  = 
NEAREST(T,  x)  that  is  closest  to  x  among  those  that 
have  already  been  added  to  T.  We  then  attempt  a  virtual 
drive  from  Xmin  to  x  using  the  subroutine  x  = 
EXTEND (xmin,  x).  The  point  x  is  the  first  intersection 
of  this  virtual  path  with  the  border  of  the  free  space  region 
Xfree,  or,  if  the  border  is  not  reached,  the  random  point  x. 
The  point  x  is  then  added  to  the  tree  T  as  a  branch  con- 
nected to  the  point  x^in  that  was  closest  to  x  among  the 
preexisting  elements  of  the  tree.  The  algorithm  terminates 
when  a  point  x  G  Xg  in  the  goal  configuration  is  added  to 
the  tree. 

A  common  problem  encountered  when  applying  RRT 
algorithms  to  high-dimensional  state  spaces  is  that  com- 
putation of  Nearest  is  inefficient  for  increasing  tree  sizes. 
We  adopt  the  strategy  of  storing  the  tree  T  in  a  KD-tree 
data  structure  which  stores  states  in  by  recursively 
subdividing  based  on  alternating  axis-aligned  hyperplanes 
[27].  This  enables  approximate  nearest  neighbor  calcula- 
tions that  maintain  performance  even  as  the  dimension 
increases.  However,  there  are  two  additional  difficulties 
that  arise  when  applying  standard  RRT  algorithms  to  solve 
the  specific  high-dimensional  network  connectivity  prob- 
lem in  (6):  1)  the  verification  of  feasible  states  as  Extend  is 
used  to  expand  the  tree  towards  x;  and  2)  the  prohibitive 
cost  of  uniformly  exploring  X^ee  for  our  high-dimensional 
problem  with  slow-to-compute  constraints.  We  discuss  this 
two  issues  in  the  following  two  sections. 

1)  Efficient  Verification  of  Feasible  States:  The 
EXTEND (xi,  X2)  algorithm  attempts  to  virtually  drive 
the  system  from  Xi  towards  X2  by  successively  verifying 
that  points  along  the  line  connecting  Xi  and  X2  are  in  Xf^-ee- 
It  returns  the  state  x  as  the  closest  state  to  X2  such  that  all 
states  sampled  with  precision  Ax  between  Xi  and  X2  are 
in  Xfree-  In  traditional  motion  planning  applications. 


The  free  space  Xf^ee  is  thenX  \  (Xobs  U  Xinf).  Finally,  a  path 
in  X  is  parameterized  by  a  scalar  s  >  0  and  given  by 
CT  :  [0,  s]  ^  X.  A  feasible  path,  and  solution  to  our  global- 
planning  problem,  is  then  cr  :  [0,s]  ^  Xf^ee  such  that 
cr(0)  =  Xinit  and  ct(s)  G  Xg. 

The  dimensionality  of  our  problem  and  the  high  com- 
putational cost  of  verifying  a  state  as  in  Xf^ee  makes  deter- 
ministic search  algorithms  impractical.  Instead  we  turn  to 
probabilistic  search  methods  that  offer  good  space  filling 
properties  and  efficient  exploration  of  an  unknown  space 
like  the  rapidly  exploring  random  tree  (RRT)  algorithms 
[26].  The  basic  structure  of  an  RRT,  as  depicted  in  Fig.  10, 
is  to  start  with  an  initial  point  Xq  and  expand  to  fully 


Fig.  10.  Graphical  depiction  of  the  RRT  search  process  visualized  in  R^. 
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verification  that  x  G  X^ee  is  based  on  an  algebraic  con- 
straint or  collision  query  with  a  multitude  of  efficient 
methods  for  doing  so  [28]-[30].  While  the  necessary  com- 
putation to  determine  x  0  Xobs  is  typically  small,  compu- 
tation of  X  ^  Xinf  requires  a  solution  of  the  SOCP  in 
Section  IV  and  can  be  costly  for  high-dimensional  systems. 

Consequently,  we  store  a.(x)  for  every  node  in  T  and 
recompute  a.(x)  only  when  necessary  to  extend  new 
states.  By  relying  on  the  fact  that  an  optimal  robust  routing 
solution  a.(x)  will  be  feasible  for  neighboring  states,  it  is 
often  possible  to  extend  x  towards  x  without  the  costly 
overhead  of  numerical  optimization. 

2)  Biased  Space  Sampling:  Random  states  x  are  chosen  to 
sample  the  space  X  C  R^^  according  to  a  probability  dis- 
tribution Px(x)  representing  the  belief  about  configuration 
X  being  part  of  a  feasible  path  cr(s).  If  nothing  is  known 
about  cr(s),  we  choose  px(x)  uniform  in  the  space  X.  In 
general,  at  least  the  final  configuration  is  known  in  that 
ct(s)  G  Xg.  We  can  then  bias  the  distribution  by  designing 
px(x)  to  choose  configurations  in  Xg  with  higher  proba- 
bility. Goal  biasing  improves  efficiency  of  RRT  algorithms 
by  reducing  the  number  of  samples  necessary  to  find  a 
feasible  path  cr(s)  in  the  high-dimensional  space  X  C  R^^. 

In  many  cases  of  interest,  however,  the  volume  of  Xg  is 
comparable  to  the  volume  of  X  and  goal  biasing  offers  little 
improvement  over  uniform  sampling.  In,  for  example,  the 
telepresence  application,  the  goal  position  of  the  leader  Xi^g 
is  known,  but  the  positions  of  the  remaining  robots  are 
free.  Thus,  goal  biasing  would  reduce  the  exploration  cost 
along  the  components  associated  with  X£  but  keep  the  cost 
of  exploring  the  remaining  2(N  —  l)  dimensions  fixed.  To 
further  reduce  exploration  cost  in  this  case  we  construct  a 
prediction  Xg  C  Xg  of  the  final  configuration  and  bias 
sampling  towards  this  prediction. 

Constructing  a  final  configuration  prediction  Xg  is  task 
specific.  We  describe  here  a  method  applicable  to  the  tele- 
presence application.  To  determine  the  configuration  pre- 
diction Xg  we  determine  configuration  predictions  Xj^g  for 
each  robot  and  compute  Xg  as  the  Cartesian  product  of 
these  individual  sets,  i.e.,  Xg  =  riili^i,g-  Notice  that  for  the 
lead  robot  we  can  makeX^^g  =  {xi  G  R  :  ||x^  —  Xi^g\\  <  S}. 

Observe  now  that  X  C  R^^  is  the  Cartesian  product 
X  =  Y[f=i  of  ^  decoupled  spaces  X^  G  R^  corre- 
sponding to  each  individual  robot.  If  we  further  assume  a 
homogeneous  team  of  robots,  then  all  robots  operate  in  the 
same  space  X^  =  Y,  with  a  common  set  of  physical  ob- 
stacles Yobs?  and  consequently  a  common  free  space  Yf^ee  = 
Y  \  Yobs-  It  follows  that  the  joint  free  space  Xf^ee  is  also  a 
Cartesian  product  of  N  identical  sets  Yf^ee  minus  those 
configurations  for  which  a  network  cannot  be  established 
with  sufficient  reliability 


Xfree  =  (^free)    \Xinf.  (17) 


While  infeasible  network  configurations  are  captured  by 
Xinf  as  given  in  (16),  Xf^ee  can  otherwise  be  described  by 
the  free  space  of  individual  robots. 

To  exploit  this  observation,  we  first  determine  an 
obstacle-free  path  7  :  [0,  s]  R^  such  that  7(0)  =  xq  is 
the  position  of  the  operating  center  and  7(5)  G  X^^g.  The 
obstacle-free  path  7  :  [0,  s]  Yf^-ee  is  split  into  N  —  1  equal 
length  segments  j^.  The  ith  robot  is  then  assigned  to  a 
segment  by  the  function  fe(i)  based  on  Euclidian  distance 
to  its  midpoint.  Segments  are  enlarged  to  define  the  region 
Xi  g  for  i  7^  0,  ^.  Since  this  is  a  heuristic  for  the  goal 
configuration,  the  only  requirement  on  Xi^g  is  that 
7)2(i)  :  [0,s]      Xi^g.  A  typical  choice  is 

Xi,g  =  |xi  :  min||xi  -  Jk{i){s)\\  <  4} 

where  dg  is  a  parameter  controlling  the  enlarged  size  of 
Xj^g.  The  predicted  final  configuration  is  then  computed  as 
the  Cartesian  product  Xg  =  ]^|L^Xi  g.  The  construction  of 
Xg  described  above  is  based  on  the  heuristic  that  a  feasible 
goal  configuration  in  an  environment  with  obstacles  will 
resemble  a  line-of-sight  communication  chain.  Increasing 
the  size  of  Xg  with  large  values  of  dg  limits  the  implication 
of  this  assumption. 

D.  Experimental  Results 

The  randomized  motion  planner  is  able  to  find 
feasible  configurations  that  allow  target  servicing  at 


Fig.  11.  The  global  planning  task  for  five  robots  In  a  complex 
environment.  We  depict  the  series  of  waypolnts  that  xs  must  visit  and 
the  Initial  conditions  for  The  Inset  Images  depict  our 

experimental  system  deployed  In  the  environment. 
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Fig.  12.  The  end-to-end  rates  of  the  nodes  during  the  global  planning 
experiment  depicted  In  Fig.  11,  In  each  plot,  the  shaded  envelope 
depicts  ij  and  the  variations  that  occur  with  probability  e  =  0.75  based 
on  ii  while  the  dashed  black  line  depicts  the  Instantaneous  rate  a  -,. 
Note  that  as  min  =  0.25  while  all  other  a/  min  =  O. 


positions  not  attainable  with  the  local  control  approach 
from  Section  V-A.  It  can  additionally  provide  a  feasible 
sequence  of  configurations  to  get  to  the  target  configura- 
tion. To  test  the  global  planner  on  a  system  with  one  fixed 
access  point  and  five  robots,  we  introduce  a  sequence  of 
task  potential  functions  ^i(x),  ^2(x), . . . ,  ^m(x)  that 


require  the  lead  node  X5  to  visit  a  sequence  of  positions 
while  the  remaining  four  robots  act  as  relays  to  support 
end-to-end  communication  with  the  access  point  of 
ci5,min  =  0.25.  We  require  that  this  end-to-end  rate  be 
satisfied  with  probability  e  =  0.75.  The  problem  is  made 
more  complicated  by  the  introduction  of  obstacles  that  not 
only  block  robot  motions  but  also  degrade  received  signal 
strength  in  (7)  by  7.6  dBm  when  line-of-sight  is  lost. 

The  global  planning  algorithm  described  in  Section  V-C 
is  queried  to  find  a  feasible  path  CTj  :  [0,  s]  R^^  for  each 
task  ^i(x)  in  order  such  that  (JiiO)  —  cri_i(s)  and  cri(O)  = 
Xinit-  The  trajectory  of  the  lead  node  X5  for  the  concate- 
nation of  paths  (T  =  (Ti|  . .  .  |(Tm  is  depicted  in  Fig.  11.  After 
solving  for  a  feasible  path,  x^^^(t)  is  computed  so  that  x(t) 
follows  (T. 

We  conduct  an  experimental  deployment  of  five  robots 
controlling  the  feasible  trajectory  a  for  the  problem  de- 
picted in  Fig.  11.  In  this  experiment,  each  robot  is 
equipped  with  a  2.4-GHz  Zigbee  radio  that  is  used  to 
broadcast  and  receive  packets  in  order  to  measure  in- 
stantaneous received  signal  strength  that  can  be  used  to 
estimate  the  supported  communication  rate  Aij(t)  between 
node  i  and  ]  at  time  t.  Using  these  measurements,  in  con- 
junction with  the  network  routing  solution  a.(t),  we  can 
estimate  the  actual  supported  end-to-end  rate  at  time  t  for 
each  node  i,  di(a.(t),  x(t)).  The  metric  for  the  performance 
of  our  approach  is  then  that  each  di(o(.(t), x(t))  >  cii,min 
with  probability  e  across  the  duration  of  the  trial. 

Fig.  12  depicts  the  predicted  di(a.(t), x(t))  and  mea- 
sured di(a.(t),  x(t))  end-to-end  rate  of  each  node  for  the 
duration  of  the  experiment.  First,  note  that  over  the  entire 
trajectory  produced  with  our  global  planning  methods,  the 
prediction  of  end-to-end  rates  always  satisfies  the  problem 
specification  that  P[ai(cv.,x)  >  Qi^min]  ^  Second,  when 
we  examine  the  actual  end-to-end  rates  based  on  real 
measurements  of  point-to-point  wireless  channels  we  see 
that  it  is  generally  true  that  di(a.(t),  x(t))  >  flj  min*  III  fact, 
if  we  analyze  this  constraint  across  the  duration  of  the 
experiment,  we  find  that  the  fraction  of  time  spent  below 


Fig.  13.  Snapshots  of  the  network  configuration  during  the  experimental  deployment  based  on  global  planning  to  satisfy  the  task  In  Fig,  11. 
The  end-to-end  rate  performance  Is  depicted  In  Fig.  12.  (a)  t  =  100  s;  (b)  t  =  278  s;  (c)  t  =  482  s;  (d)  t  =  622  s. 
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Fig.  14.  Running  time  for  a  benctimarl<  planning  tasli  computed  witti 
an  increasing  number  of  robots  N  and  different  end-to-end  rate 
tfiresfiolds  amin*  Since  the  global  planning  algorithm  is  randomized, 
each  point  is  the  average  running  time  over  ten  trials. 


the  minimum  threshold  for  each  of  the  instantaneous  rates 
ai,  a2,  as,  04,  and  as  is  9.2%,  0.8%,  0.3%,  0.6%,  and  2.9%, 
respectively. 

Representative  configurations  from  the  experiment  are 
depicted  in  Fig.  13.  In  Fig.  13,  at  t  =  100  s,  the  predicted 
goal  state  Xg  assumes  the  shortest  line-of-sight  path  which 
is  the  left  hallway,  i.e.,  a  similar  result  to  the  reactive 
methods  in  our  local  control  algorithm.  As  the  system 
transitions  to  Fig.  13(b),  where  the  lead  node  X5  has  been 
tasked  to  a  waypoint  in  the  right  hallway,  the  prediction  for 
Xg  shifts  to  a  chain  of  relays  going  through  the  right 
hallway.  This  shift  in  the  basic  topology  of  Xg  focuses  on 
exploration  of  the  joint  state  space  so  that  X4  moves 
towards  a  configuration  that  will  lovoer  the  performance  of 
the  network  over  the  short  term.  As  node  X5  completes  the 
desired  loop,  it  utilizes  X4  as  a  relay  channel  and  is  able  to 
maintain  the  desired  end-to-end  rate.  It  is  this  dramatic 
shift  in  network  topology  that  highlights  the  advantage  of 
our  global  planning  approach  as  we  are  able  to  accomplish 
continuous  end-to-end  rate  maintenance  that  would  not  be 
possible  with  a  purely  reactive  method. 

With  regards  to  the  running  time  of  the  randomized 
global  planning  algorithm,  we  note  that  it  is  difficult  to 
characterize  the  performance  of  randomized  search  algo- 
rithms. One  factor  is  the  complexity  of  Xf^-ee  which  is  de- 


termined both  by  physical  obstacles  in  the  environment  as 
well  as  the  constraints  placed  on  feasible  network  configu- 
rations, i.e.,  the  solution  to  a.(x).  Fig.  14  depicts  the  run- 
ning time  for  a  benchmark  environment  as  we  increase  the 
dimension  of  the  problem  N  by  adding  robots  to  the  sys- 
tem. We  also  vary  the  end-to-end  rate  threshold  Qmin  that 
must  be  maintained  for  the  lead  node  to  demonstrate  how 
more  strict  network  requirements  also  increase  complexity 
of  the  search  space. 

VI.  CONCLUSION 

We  propose  a  system  architecture  that  provides  end-to-end 
connectivity  for  autonomous  teams  of  robots  as  they  pur- 
sue operator-assigned  tasks.  This  architecture  is  composed 
of  a  cyber  component  that  determines  the  configuration  of 
the  wireless  network  and  a  physical  component  that  hand- 
les mobility.  Because  the  performance  of  point-to-point 
wireless  links  is  difficult  to  predict,  we  adopt  a  stochastic 
model  for  supported  rates  and  develop  optimal  robust  so- 
lutions to  the  wireless  routing  problem.  The  strengths  of 
the  proposed  approach  are  that:  1)  network  availability 
takes  into  account  specific  thresholds  on  the  end-to-end 
rates  of  individual  nodes;  2)  considering  the  stochastic 
nature  of  communication  channels  network  availability  is 
further  defined  in  probabilistic  terms;  3)  by  specifically 
searching  for  robust  solutions,  we  obtain  configurations 
with  spatial  diversity  and  increased  probability  of  success 
in  the  face  of  difficult  to  predict  communication  channels. 

Robust  routing  solutions  are  integrated  with  a  physical 
component  that  handles  mobility  either  through  local 
control  or  global  planning.  Local  controllers  converge  to 
solutions  that  are  local  minima  of  the  concurrent  mobility 
and  network  routing  problem.  The  global  planner  per- 
forms a  randomized  biased  search  in  the  space  of  spatial 
configurations  in  an  attempt  to  jointly  solve  the  mobility 
and  optimal  network  routing  problems.  We  perform  expe- 
riments to  demonstrate  that  the  global  planning  algorithm 
succeeds  in  navigating  a  complex  environment  while  en- 
suring that  end-to-end  communication  rates  meet  or  ex- 
ceed prescribed  values  within  a  target  failure  tolerance. 
Global  planners  are  able  to  discover  dramatic  shifts  in 
communication  topology  that  allow  for  continuous  commu- 
nication that  is  not  possible  with  purely  local  methods.  ■ 
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Information  Sciences  Directorate  (CISD).  He  received  his  Ph.D.  from  the  University  of  Pennsylvania  in  2011,  where  he  worked 
in  the  General  Robotics,  Automation,  Sensing  and  Perception  (GRASP)  Laboratory  on  communication-constrained  planning  for 
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and  Atmospheric  Sciences  Department.  He  is  a  Fellow  of  the  Geological  Society  of  America,  National  Speleological  Society, 
and  ARL.  Dr.  Harmon  has  authored  or  coauthored  more  than  200  peer-reviewed  publications  and  remains  actively  engaged 
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Directorate.  She  earned  a  B.A.  in  biology  from  Rice  University  and  a  Ph.D.  in  biology  from  UCLA;  her  dissertation  focused  on 
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University  of  Southern  California,  all  in  electrical  engineering.  Dr.  Swami  has  published  more  than  300  journal  and  conference 
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University  and  George  Washington  University.  His  awards  include  the  Military  Communications  Technical  Achievement  Award 
and  the  Best  Paper  Award  of  the  Institute  of  Electrical  and  Electronics  Engineers  Military  Communications  Conference.  In  1989, 
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